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POLYNUCLEOTIDES FOR USE AS TAGS AND TAG COMPLEMENTS, 
MANUFACTURE AND USE THEREOF 

FIELD OF THE INVENTION 

5 This invention relates to families of oligonucleotide tags for use, for 

example, in sorting molecules. Members of a given family of tags can be 
distinguished one from the other by specific hybridization to their tag 
complements. 

10 BACKGROUND OF THE INVENTION 

Specific hybridization of oligonucleotides and their analogs is a 
fundamental process that is employed in a wide variety of research, medical, 
and industrial applications, including the identification of disease-related 
polynucleotides in diagnostic assays, screening for clones of novel target 

15 polynucleotides, identification of specific polynucleotides in blots of 

mixtures of polynucleotides, therapeutic blocking of inappropriately expressed 
genes and DNA sequencing. Sequence specific hybridization is critical in the 
development of high' throughput multiplexed nucleic acid assays. As formats 
for these assays expand to encompass larger amounts of sequence information 

20 acquired through projects such as the Human Genome project, the challenge of 
sequence specific hybridization with high fidelity is becoming increasingly 
difficult to achieve. 

In large part, the success of hybridization using oligonucleotides 
depends on minimizing the number of false positives and false negatives. Such 

25 problems. have made the simultaneous use of multiple hybridization probes in a 
single experiment i.e. multiplexing, particularly in the analysis of multiple 
gene sequences on a gene microarray, very difficult. For example, in certain 
binding assays, a number of nucleic acid molecules are bound to a chip with 
the desire that a given w target" sequence will bind selectively to its 

30 complement attached to the chip. Approaches have been developed that involve 
the use of oligonucleotide tags attached to a solid support that can be used 
to specifically hybridize to the tag complements that are coupled to probe 
sequences. Chetverin et al. (WO 93/17126) useB sectioned, binary 
oligonucleotide arrays to sort and survey nucleic acids. These arrays have a 

35 constant nucleotide sequence attached to an adjacent variable nucleotide 

sequence, both bound to a solid support by a covalent linking moiety. These 
binary arrays have advantages compared with ordinary arrays in that they can 
be used to sort strands according to their terminal sequences so that each 
strand binds to a fixed location on an array. The design of the terminal 

40 sequences in this approach comprises the use of constant and variable 
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sequences. United States Patent Nos. 6,103,463 and 6,322,971 issued to 
Chetverin et al. on August 15, 2000 and November 27, 2001, respectively. 

This concept of using molecular tags to sort a mixture of molecules is 
analogous to molecular tags developed for bacterial and yeast genetics (Hensel 
et al., Science; 269, 400-403: 1995 and Schoemaker et al . , Nature Genetics; 
14, 450-456: 1996). Here, a method termed "signature tagged" mutagenesis in 
which each mutant is tagged with a different DNA sequence is used to recover 
mutant genes from a complex mixture of approximately 10,000 bacterial 
colonies. In the tagging approach of Barany et al . (WO 9731256), known as 
the "zip chip", a family of nucleic acid molecules, the "zip-code addresses", 
each different from each other, are set out on a grid. Target molecules are 
attached to oligonucleotide sequences complementary to the "zipcode 
addresses," referred to as "zipcodes," which are used to specifically 
hybridize to the address locations on the grid. While the selection of these 
families of polynucleotide sequences used as addresses is critical for correct 
performance of the assay, the performance has not been described. 

Working in a highly parallel hybridization environment requiring 
specific hybridization imposes very rigorous selection criteria for the design 
of families of oligonucleotides that are to be used. The success of these 
approaches is dependent on the specific hybridization of a probe and its 
complement. Problems arise as the family of nucleic acid molecules cross- 
hybridize or hybridize incorrectly to the target sequences. While it is 
common to obtain incorrect hybridization resulting in false positives or an 
inability to form hybrids resulting in -false negatives, the frequency of such 
results must be minimized. . In order to achieve this goal certain 
thermodynamic properties of forming nucleic acid hybrids must be considered. 
The temperature at which oligonucleotides form duplexes with their 
complementary sequences known as the T m (the temperature at which 50% of the 
nucleic acid duplex is dissociated) varies according to a number of sequence 
dependent properties including the hydrogen bonding energies of the canonical 
pairs A-T and G-C (reflected in .GC or base composition) , stacking free energy 
and, to a lesser extent, nearest neighbour interactions. These energies vary 
widely among oligonucleotides that are typically used in hybridization assays. 
For example, hybridization of two probe sequences composed of 24 nucleotides, 
one with a 40% GC content and the other with a 60% GC content, with its 
complementary target under standard conditions theoretically may have a 10 °C 
difference in melting temperature (Mueller et al., Current Protocols in Mol. 
Biol.; 15, 5:1993). Problems in hybridization occur when the hybrids are 
allowed to form under hybridization conditions that include a single 
hybridization temperature that is not optimal for correct hybridization of all 
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oligonucleotide sequences of a set. Mismatch hybridization of non- 
complementary probes can occur forming duplexes with measurable mismatch 
stability (Santalucia et al., Biochemistry; 38: 3468-77, 1999). Mismatching 
of duplexes in a particular set of oligonucleotides can occur under 
hybridization conditions where the mismatch results in a decrease in duplex 
stability that results in a higher T ra than the least stable correct duplex of 
that particular set. For example, if hybridization is carried out under 
conditions that favor the AT-rich perfect match duplex sequence, the 
possibility exists for hybridizing a GC-rich duplex sequence that contains a 
mismatched base having a melting temperature that is still above the correctly 
formed AT-rich duplex. Therefore, design of families of oligonucleotide 
sequences that can be used in multiplexed hybridization reactions must include 
consideration for the thermodynamic properties of oligonucleotides and duplex 
formation that will reduce or eliminate cross hybridization behavior within 
the designed oligonucleotide set. 

The development of such families of tags has been attempted over the 
years with varying degrees of 'success. There are a number of different 
approaches for selecting sequences for use in multiplexed hybridization 
assays. The selection of sequences that can be used as zipcodes or tags in an 
addressable array has been described in the patent literature in an approach 
taken by Brenner and co-workers. United States Patent No. 5,654,413 describes 
a population of oligonucleotide tags (and corresponding tag complements) in 
which each oligonucleotide tag includes a plurality of subunits, each subunit 
consisting of an oligonucleotide having a length of from three to six 
nucleotides and each subunit being selected from a minimally cross hybridizing 
set, wherein a subunit of the set would have at least two mismatches with any 
other sequence of the set. Table II of the Brenner patent specification 
describes exemplary groups of 4mer subunits that are minimally cross 
hybridizing according to the aforementioned criteria. In the approach taken 
by Brenner, constructing non cross -hybridizing oligonucleotides, relies on the 
use of subunits that form a duplex having at least two mismatches with the 
complement of any other subunit of the same set. The ordering of subunits in 
the construction of oligonucleotide tags is not specifically defined. 

Parameters used in the design of tags based on subunits are discussed in 
Barany et al. (WO 9731256) . For example, in the design of polynucleotide 
sequences that are for example 24 nucleotides in length (24mer) derived from a 
set of four possible tetramers in which each 24mer "address" differs from its 
nearest 24mer neighbour by 3 tetramers. They discuss further that, if each 
tetramer differs from each other by at least two nucleotides, then each 24mer 
will differ from the next by at least six nucleotides. This is determined 
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without consideration for insertions or deletions when forming the alignment 
between any two sequences of the set.- In this way a unique "zip code" 
sequence is generated. The zip code is ligated to a label in a target 
dependent manner, resulting in a unique "zip code" which is then allowed to 
hybridize to its address on the chip. To minimize cross -hybridization of a 
"zip code" 'to other "addresses", the hybridization reaction is carried out at 
temperatures of 75-80°C. Due to the high temperature conditions for 
hybridization, 24mers that have partial homology hybridize to a lesser extent 
than sequences with perfect complementarity and represent "dead zones'-. This 
approach of implementing stringent hybridization conditions for example, 
involving high temperature hybridization, is also practiced by Brenner et. al. 

The current state of technology for designing non-cross hybridizing tags 
based on subunits does not provide sufficient guidance to construct a family 
of relatively large numbers of sequences with practical value in assays that 
require stringent non-cross hybridizing behavior. 

A multiplex sequencing method has been described in United States 
Patent No. 4,942,124, which issued to Church on July 17, 1990. The 
method requires at least two vectors which differ from each other at a 
tag sequence. It is stated that a tag sequence in one vector will not 
hybridize under stringent hybridization conditions to a tag sequence 
(i.e., complementary probes do not cross -hybridize) in another vector. 
Exemplary stringent hybridization conditions are given as 42°C in 500- 
1000 mM sodium phosphate buffer. A set of 42 20-mer tag sequences, all 
of which lack G residues, is given in Figure 3 of the specification. 
Details of how the sequences were obtained are not provided, although 
Church states that initially 92 were chosen on the basis of their having 
sufficient sequence diversity to insure uniqueness. 

So while it is possible for a person knowledgeable in the field to 
design a small number of non-cross hybridizing tags, it is difficult to design 
a larger number such tags. A co-pending application of the owner of this 
patent application describes such a set of 210 non-cross hybridizing tags that 
have a practical value. A method described in international patent 
application No. PCT/CA 01/00141 published under WO 01/59151 on August 16, 
2001. Little guidance is provided, however, for the provision of a larger 
set, say 1000 or so, of non-cross hybridizing tags. Since having sets of 
approximately 1000 non- cross hybridizing tags, or more, would be of 
considerable practical value, it would be useful to develop such a. set. 

Thus, while it is desirable with such arrays to have, at once, a large 
number of address molecules, the address molecules should each be highly 
selective for its own complement sequence:" While such an array provides the 
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advantage that the family of molecules making up' the grid is entirely of 
design, and does not rely on sequences as they occur in nature, the provision 
of a family of molecules., which is sufficiently large and where each 
individual member is sufficiently selective for its complement over all the 
other zipcode molecules (i.e., where there is sufficiently low cross- 
hybridization, or cross-talk) continues to elude researchers. 



SUMMARY OF INVENTION 

A family of 1168 sequences was obtained using a computer algorithm to 
have desirable hybridization properties for use in nucleic acid detection 
assays. The sequence set of 1168 oligonucleotides was partially characterized 
in hybridization assays, demonstrating the ability of family members to 
correctly hybridize to their complementary sequences with minimal cross 
hybridization. These are the sequences having SEQ ID NOs : 1 to 1168 of Table 
I, 

Variant families of sequences (seen as tags or tag complements) of 
a family of sequences taken from Table I are also part of the invention. 
For the purposes of discussion, a family or set of oligonucleotides will 
often be described as a family of tag complements, but it will be 
understood that such a set could just easily be a family of tags. 

A family of complements is obtained from a set of oligonucleotides 
based on a family of oligonucleotides such as those of Table I. To 
simplify discussion, providing a family of complements based on the 
oligonucleotides of Table I will be described. 

Firstly, the groups of sequences based on the oligonucleotides of 
.Table I can be represented as shown in Table IA. 

Table IA: Numeric sequences corresponding to nucleotide 
base patterns of a set of oligonucleotides 

Numeric Pattern Sequence 
Identifier 
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Table XA: Numeric sequences corresponding to nucleotide 
base patterns of a set of oligonucleotides 

Numeric Pattern Sequence 
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2 


52 


2 


3 


2 


3 


2 


2 


2 


1 


3 


1 


1 


2 


2 


2 


1 


3 


2 


1 


2 


3 


2 


3 


2 


1 


53 


3 


1 


2 


1 


1 


2 


3 


1 


2 


2 


1 


2 


1 


3 


1 


1 


1 


3 


2 


3 


2 


2 


2 


3 


54 


3 


2 


2 


1 


2 


2 


2 


3 


2 


1 


1 


3 


2 


2 


1 


1 


3 


1 


2 


1 


3 


2 


1 


3 


55 


1 


3 


2 


2 


2 


1 


2 


2 


3 


1 


1 


1 


3 


1 


3 


2 


2 


2 


3 


1 


1 


2 


1 


3 


56 


2 


2 


3 


2 


3 


2 


2 


2 


1 


2 


2 


3 


2 


3 


2 


1 


3 


2 


2 


2 


1 


1 


1 


3 


57 


1 


2 


2 


3 


2 


3 


1 


3 


1 


1 


3 


1 


2 


1 


2 


3 


1 


1 


1 


3 


2 


2 


1 


2 


58 


2 


3 


1 


3 


1 


1 


2 


3 


2 


1 


1 


1 


3 


1 


1 


2 


3 


2 


2 


2 


1 


2 


2 


3 


59 


1 


2 


3 


2 


3 


1 


1 


1 


3 


2 


2 


1 


2 


3 


1 


2 


3 


2 


2 


1 


1 


2 


2 


3 


60 


3 


2 


2 


2 


1 


3 


2 


1 


2 


2 


1 


3 


2 


2 


3 


2 


2 


1 


1 


3 


i 


2 


2 


3 


61 


3 


1 


2 


2 


3 


1 


2 


1 


2 


2 


2 


3 


1 


1 


2 


3 


2 


2 


2 


3 


2 


2 


2 


3 


' 62 


2 


3 


1 


1 


2 


2 


3 


1 


1 


1 


3 


2 


3 


2 


1 


1 


2 


3 


2 


2 


3 


2 


1 


2 


63 


3 


1 


2 


2 


3 


2 


1 


2 


2 


3 


2 


2 


3 


1 


3 


1 


1 


2 


1 


3 


1 


1 


2 


1 


64 


1 


1 


1 


2 


2 


2 


3 


1 


3 


1 


2 


2 


2 


3 


2 


3 


1 


2 


1 


3 


1 


3 


2 


1 


65 


3 


2 


1 


1 


2 


2 


1 


3 


1 


2 


2 


2 


3 


2 


2 


2 


3 


2 


2 


3 


2 


2 


3 


.2 


66 


3 


2 


2 


2 


3 


2 


1 


2 


2 


3 


2 


2 


1 


3 


2 


3. 


1 


1 


2 


1 


2 


1 


3 


2 


67 


1" 


2 


3 


2 


1 


3 


2 


1 


3 


2 


1 


3 


1 


2 


3 


2 


2 


2 


1 


2 


3 


1 


1 


2 


68 


2 


3 


2 


'2 


2 


1 


1 


1 


3 


1 


2 


3 


1 


2 


2 


3 


1 


1 


3 


1 


1 


1 


2 


3 


69 


2 


3 


2 


3 


1 


2 


1 


1 


2 


3 


1 


2 


3 


2 


2 


1 


2 


2 


2 


3 


2 


3 


2 


1 


70 


1 


2 


1 


3 


2 


2 


3 


2 


3 


1 


3 


1 


1 


2 


2 


2 


3 


2 


1 


1 


2 


2 


1 


3 


71 


1 


2 


1 


3 


1 


2 


3 


2 


1 


1 


3 


1 


3 


1 


1 


1 


2 


2 


3 


2 


3 


1 


1 


1 


72 
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Table IA? Numeric sequences corresponding to nucleotide 
base patterns of a set of oligonucleotides 

Numeric Pattern Sequence 

Identifier 



1 


3 


1 


2 


2 


1 


1 


3 


1 


3 


1 


1 


3 


2 


2 


1 


1 


2 


1 


3 


1 


3 


2 


1 


73 


3 


1 


1 


3 


2 


1 


1 


1 


2 


2 


3 


2 


3 


1 


1 


2 


3 


1 


1 


1 


3 


1 


1 


1 


74 


1 


1 


2 


3 


2 


1 


1 


3 


1 


1 


1 


3 


1 


1 


3 


1 


2 


2 


3 


2 


2 


3 


2 


1 


75 


2 


2 


2 


3 


1 


2 


2 


2 


1 


2 


3 


2 


3 


2 


2 


1 


2 


3 


2 


2 


3 


1 


3 


2 


76 


3 


2 


1 


2 


2 


3 


1 


3 


1 


1 


1 


2 


2 


2 


3 


1 


1 


3 


1 


1 


2 


3 


1 


1 


77 


3 


1 


1 


2 


2 


3 


2 


1 


2 


3 


1 


1 


1 


2 


3 


1 


1 


2 


2 


3 


2 


1 


1 


3 


78 


2 


1 


2 


2 


3 


2 


1 


3 


1 


1 


3 


2 


1 


1 


1 


3 


2 


2 


1 


3 


1 


1 


3 


2 


79 


2 


2 


2 


1 


2 


3 


2 


1 


1 


2 


3 


1 


2 


1 


1 


3 


2 


3 


2 


1 


3 


2 


2 


3 


80 


1 


2 


1 


2 


1 


3 


2 


2 


3 


1 


1 


1 


2 


2 


3 


2 


3 


1 


2 


1 


3 


2 


3 


2 


81 


1 


2 


1 


1 


3 


1 


1 


1 


2 


2 


1 


3 


1 


3 


1 


3 


2 


2 


3 


2 


1 


1 


1 


3 


82 


3 


1 


1 


2 


2 


3 


2 


3 


1 


1 


1 


2 


3 


2 


3 


1 


2 


2 


3 


1 


2 


1 


2 


1 


83 


1 


1 


1 


2 


1 


1 


3 


2 


1 


3 


2 


2 


2 


1 


1 


2 


3 


1 


3 


1 


3 


1 


1 


3 


84 


3 


1 


2 


2 


1 


1 


1 


3 


1 


1 


3 


2 


1 


1 


3 


2 


3 


1 


1 


2 


3 


2 


2 


2 


85 


2 


1 


2 


3 


2 


3 


2 


3 


2 


2 


3 


2 


2 


2 


1 


3 


2 


3 


2 


2 


1 


2 


2 


1 


86 


3 


1 


3 


2 


2 


1 


2 


1 


2 


3 


2 


1 


3 


2 


2 


1 


3 


1 


3 


2 


2 


1 


2 


1 


87 


3 


1 


1 


1 


3 


1 


1 


1 


3 


1 


1 


3 


2 


3 


2 


2 


1 


1 


3 


2 


2 


1 


1 


1 


88 


2 


1 


3 


2 


1 


2 


2 


1 


3 


2 


1 


1 


3 


2 


1 


2 


3 


2 


3 


1 


2 


2 


3 


2 


89 


2 


2 


3 


2 


3 


2 


3 


1 


2 


2 


3 


1 


1 


2 


1 


2 


2 


3 


2 


3 


1 


1 


1 


2 


90 


1 


2 


3 


2 


3 


1 


1 


1 


3 


1 


3 


2 


2 


1 


1 


3 


2 


3 


1 


2 


2 


1 


1 


1 


91 


3 


1 


2 


2 


3 


1 


1 


2 


3 


1 


2 


2 


3 


1 


3 


1 


2 


1 


2 


3 


2 


1 


1 


1 


92 


1 


1 


3 


1 


2 


3 


1 


2 


1 


3 


2 


2 


1 


1 


3 


2 


3 


2 


1 


1 


3 


2 


2 


1 


93 


2 


1 


3 


2 


2 


3 


2 


2 


1 


2 


2 


3 


1 


3 


1 


1 


2 


2 


2 


1 


3 


1 


1 


3 


• 94 


2 


2 


2 


1 


2 


1 


3 


2 


3 


1 


1 


2 


2 


1 


2 


3 


1 


3 


2 


3 


1 


1 


1 


3 


95 


3 


1 


2 


1 


3 


1 


2 


2 


2 


1 


3 


1 


1 


2 


3 


1 


1 


2 


2 


1 


1 


3 


2 


3 


96 


2 


2 


2 


3 


1 


1 


3 


1 


1 


3 


1 


3 


1 


2 


2 


2 


3 


1 


1 


1 


2 


2 


3 


1 


97 


1 


2 


3 


1 


1 


2 


1 


1 


3 


1 


3 


2 


2 


3 


1 


2 


1 


1 


1 


2 


3 


2 


3 


1 


98 


2 


3 


2 


2 


2 


1 


2 


3 


2 


1 


3 


2 


3 


2 


1 


3 


1 


2 


2 


3 


1 


1 


2 


2 


99 


2 


2 


2 


1 


1 


3 


2 


3 


1 


3 


2 


2 


1 


2 


1 


3 


1 


1 


3 


2 


1 


3 


2 


1 


100 


3 


1 


2 


2 


2 


1 


2 


3 


2 


3 


2 


2 


2 


3 


1 


1 


3 


2 


2 


1 


1 


3 


1 


2 


101 


2 


1 


3 


2 


2 


1 


3 


1 


3 


1 


1 


1 


3 


2 


3 


1 


2 


1 


1 


1 


3 


2 


2 


1 


102 


3 


2 


1 


1 


2 


3 


1 


2 


1 


1 


2 


3 


1 


1 


3 


2 


3 


2 


1 


2 


1 


2 


1 


3 


103- 


1 


1 


2 


3 


1 


1 


3 


2 


3 


2 


2 


1 


3 


2 


1 


2 


1 


3 


1 


2 


1 


3 


2 


1 


104 


2 


1 


1 


1' 


2 


2 


3 


1 


3 


2 


2 


2 


3 


2 


2 


2 


3 


1 


2 


2 


3 


2 


1 


3 


105 


2 


1 


1 


2 


3 


1 


1 


3 


1 


1 


2 


1 


1 


3 


2 


1 


2 


3 


1 


3 


2 


3 


2 


2 


106 


1 


1 


1 


2 


3 


2 


1 


1 


2 


1 


3 


2 


3 


2 


2 


3 


2 


2 


1 


3 


2 


2 


1 


3 


107 


1 


3 


1 


3 


2 


2 


1 


3 


2 


3 


1 


1 


1 


2 


3 


2 


2 


3 


2 


2 


1 


1 


1 


2 


108 


3 


1 


1 


1 


2 


1 


3 


1 


1 


1 


2 


3 


2 


1 


2 


2 


3 


2 


2 


2 


3 


2 


3 


1 


109 


1 


3 


2 


2 


1 


"2 


1 


1 


3 


2 


2 


2 


3 


2 


3 


1 


3 


1 


1 


2 


2 


1 


1 


3 


110 


3 


1 


3 


2 


2 


2 


1 


2 


1 


3 


2 


2 


1 


3 


1 


1 


2 


1 


2 


3 


2 


2 


3 


2 


111 


1 


3 


1 


3 


2 


2 


1 


2 


2 


1 


3 


1 


1 


3 


1 


1 


3 


1 


2 


2 


2 


1 


1 


3 


112 


3 


1 


3 


2 


2 


1 


1 


2 


3 


1 


1 


1 


2 


1 


1 


3 


2 


1 


2 


2 


2 


3 


2 


3 


113 


1 


2 


3 


1 


2 


3 


1 


1 


2 


1 


3 


2 


2 


3 


1 


1 


3 


2 


1 


2 


1 


2 


1 


3 


114 


1 


2 


1 


3 


1 


2 


1 


2 


3 


1 


3 


1 


2 


3 


1 


1 


1 


3 


2 


2 


1 


3 


2 


1 


115 


2 


1 


2 


3 


2 


1 


1 


1 


3 


1 


1 


1 


3 


2 


3 


1 


1 


1 


3 


1 


1 


3 


1 


1 


116 


2 


3 


X 


1 


2 


3 


2 


1 


3 


1 


1 


1 


2 


3 


1 


1 


2 


3 


2 


2 


3 


1 


1 


1 


117 


1 


1 


2 


2 


3 


1 


1 


2 


1 


3 


2 


3 


2 


3 


2 


3 


1 


3 


2 


2 


2 


1 


1 


2 


118 


1 


3 


1 


2 


1 


2 


2 


3 


2 


2 


2 


3 


1 


2 


2 


1 


1 


2 


3 


1 


1 


3 


1 


3 


119 


1 


1 


1 


3 


2 


2 


3 


2 


1 


1 


1 


3 


2 


2 


3 


1 


1 


3 


1 


2 


1 


1 


1 


3 


120 


3 


2 


2 


1 


1 


3 


1 


3 


1 


2 


2 


1 


2 


3 


1 


3 


1 


2 


3 


2 


1 


2 


2 


1 


121 


1 


3 


1 


1 


3 


1 


2 


1 


2 


1 


1 


3 


1 


1 


3 


1 


2 


2 


3 


1 


1 


2 


2 


3 


122 


3 


2 


1 


3 


1 


1 


1 


2 


2 


2 


3 


1 


1 


2 


2 


3 


1 


2 


3 


2 


3 


1 


1 


1 


123 


1 


1 


3 


1 


3 


2 


1 


3 


1 


2 


2 


3 


1. 


2 


1 


1 


3 


2 


1 


2 


1 


2 


3 


1 


124 


2 


3 


1 


2 


1 


2 


1 


3 


2 


1 


3 


2 


3 


1 


1 


3 


1 


1 


1 


2 


1 


1 


3 


2 


125 


1 


3 


1 


2 


1 


1 


2 


3 


1 


2 


3 


1 


3 


1 


1 


1 


2 


3 


1 


1 


3 


1 


2 


1 


126 


1 


2 


3 


2 


3 


1 


1 


1 


3 


2 


1 


2 


2 


2 


3 


2 


3 


1 


2 


1 


2 


1 


3 


2 


127 


1- 


1 


2 


1 


1 


3 


1 


3 


1 


1 


2 


2 


3 


1 


2 


1 


2 


3 


1 


1 


3 


1 


2 


3 


128 
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Table XA: Numeric sequences corresponding to nucleotide 
base patterns of a set of oligonucleotides 

Numeric Pattern Sequence 

Identifier 



2 


1 


1 


3 


2 


3 


2 


1 


2 


2 


2 


1 


3 


2 


1 


3 


1 


1 


2 


3 


1 


1 


3 


2 


129 


2 


1 


2 


3 


2 


2 


1 


3 


1 


2 


2 


2 


3 


2 


2 


3 


1 


3 


1 


2 


2 


3 


1 


2 


130 


1 


3 


2 


2 


2 


3 


2 


1 


2 


3 


1 


1 


3 


1 


3 


1 


2 


1 


3 


2 


1 


2 


2 


2, 


131 


3 


1 


3 


1 


1 


1 


2 


3 


2 


2 


1 


2 


3 


2 


1 


2 


2 


2 


1 


3 


2 


1 


3 


2 


132 


2 


1 


2 


3 


2 


3 


1 


3 


1 


1 


2 


3 


2 


3 


2 


2 


2 


3 


1 


2 


2 


2 


1 


1 


133 


3 


2 


1 


2 


3 


2 


2 


2 


3 


2 


2 


2 


1 


2 


1 


3 


1 


1 


2 


3 


2 


1 


2 


3 


134 


3 


1 


3 


2 


1 


2 


1 


2 


1 


3 


1 


1 


3 


1 


1 


1 


3 


1 


1 


1 


2 


2 


2 


3 


135 


1 


2 


3 


1 


3 


2 


3 


1 


1 


3 


2 


1 


1 


1 


2 


3 


2 


1 


3 


2 


2 


1 


2 


2 


136 


2 


2 


1 


1 


3 


1 -1 


3 


2 


3 


1 


3 


2 


2 


1 


2 


2 


3 


2 


3 


1 


2 


1 


2 


137 


1 


2 


3 


1 


1 


1 


2 


3 


1 


3 


1 


1 


2 


1 


2 


2 


3 


2 


2 


3 


2 


2 


2 


3 


13 8 


3 


1 


2 


2 


1 


1 


2 


3 


1 


2 


2 


1 


2 


3 


2 


3 


1 


1 


2 


2 


3 


1 


2 


3 


139 


3 


1 


1 


1 


2 


3 


2 


2 


1 


1 


1 


3 


1 


2 


1 


2 


3 


1 


1 


1 


3 


2 


1 


3 


140 


2 


1 


2 


2 


3 


2 


2 


3 


1 


2 


2 


2 


3 


1 


2 


1 


2 


2 


1 


3 


2 


3 


2 


3 


141 


2 


2 


2 


1 


2 


3 


2 


2 


2 


3 


2 


3 


2 


1 


2 


3 


2 


1 


1 


3 


2 


1 


3 


2 


142 


1 


1 


2 


2 


3 


1 


1 


1 


3 


1 


1 


2 


2 


3 


2 


3 


2 


3 


1 


1 


2 


2 


3 


1 


143 


2 


3 


1 


3 


2 


2 


2 


3 


1 


.1 


2 


2 


2 


3 


2 


2 


2 


3 


1 


3 


2 


1 


1 


2 


144 


3 


1 


2 


3 


2 


1 


2 


1 


1 


2 


3 


1 


2 


3 


2 


3 


2 


3 


2 


1 


1 


1 


2 


2 


145 


1 


2 


3 


2 


3 


1 


3 


1 


3 


1 


1 


3 


1 


1 


2 


2 


2 


3 


2 


2 


2 


1 


2 


2 


146 


3 


2 


3 


1 


2 


1 


1 


1 


3 


2 


1 


2 


2 


3 


2 


2 


3 


1 


2 


1 


3 


1 


1 


1 


147 


3 


1 


1 


3 


2 


1 


3 


1 


1 


2 


1 


3 


1 


1 


1 


3 


2 


2 


1 


1 


2 


1 


3 


1 


148 


2 


2 


3 


2 


3 


2 


1 


3 


2 


2 


1 


1 


3 


1 


3 


2 


2 


3 


2 


2 


2 


1 


1 


2 


149 


2 


1 


3 


2 


1 


3 


2 


1 


1 


3 


2 


2 


3 


2 


2 


1 


3 


1 


1 


2 


1 


3 


2 


2 


150 


1 


1 


2 


2 


2 


3 


1 


1 


3 


2 


1 


2 


1 


1 


2 


3 


1 


1 


2 


3 


2 


3 


2 


3 


151 


2 


1 


3 


1 


1 


1 


2 


2 


3 


2 


1 


3 


2 


1 


2 


2 


2 


3 


1 


3 


1 


3 


1 


1 


152 


2 


3 


2 


1 


2 


1 


2 


3 


2 


2 


1 


1 


2 


3 


1 


3 


1 


2 


3 


2 


2 


3 


2 


1 


153 


2 


1 


2 


2 


2 


3 


1 


2 


1 


1 


3 


1 


3 


1 


1 


2 


3 


1 


1 


3 


1 


1 


3 


2 


154 


2 


2 


3 


1 


1 


2 


1 


3 


2 


3 


2 


1 


1 


2 


3 


1 


1 


2 


1 


2 


3 


1 


2 


3 


155 


3 


2 


1 


3 


2 


2 


2 


3 


2 


3 


1 


1 


2 


1 


3 


1 


1 


2 


2 


1 


3 


2 


2 


2 


156 


1 


1 


1 


3 


1 


2 


3 


1 


2 


2 


3 


2 


1 


1 


2 


2 


2 


3 


2 


3 


2 


3 


1 


1 


157 


3 


1 


1 


3 


1 


2 


2 


3 


2 


2 
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3 


2 


2 


1 
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2 
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2 


1 


1 
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2 
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1 


3 


1 


1 


3 


2 


3 


2 
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2 


2 


1 


1 


3 


2 


1 


1 


3 


2 


2 


1 


161 


2 


1 


2 


3 


1 


1 


2 


2 


1 


2 


3 


1 


3 


1 


1 


3 


1 


1 


2 


1 


3 


1 


3 


2 


162 


2 


2 


2 


3 


2 


2 


1 


2 


3 


1 


1 


3 


2 


3 


.1 


2 


2 


2 


3 


2 


2 


2 


3 


2 


163 


3 


2 


1 


1 


1 


3 


1 


2 


2 


3 


2 


3 


2 


2 


1 


2 


1 


2 


3 


1 


1 


1 


2 


3 


164 


2 


2 


3 


2 


3 


1 


2 


1 


3 


2 


1 


3 


2 


2 


1 


3 


1 


i 

2 


1 


2 


2 


2 


3 


2 


165 


3 


1 


1 


2 


2 


1 


1 


3 


1 


2 


1 


1 


1 


3 


1 


1 


3 


1 


3 


1 


1 


3 


2 


1 
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2 
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3 
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1 


3 


1 


1 


2 


3 


1 


1 


2 


2 
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3 


2 


1 


3 


2 


1 


2 


167 


1 


1 


1 


2 


1 


1 


3 


1 


3 


1 


3 
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3 


1 


1 


2 


3 


1 


2 


2 


2 


1 


3 


2 


168 


1 


1 


2 


2 


1 


2 


3 


2 


3 


1 


1 


2 


1 


3 
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2 


2 


3 


2 


2 
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1 


1 


3 
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1 
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3 
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2 


2 


2 


3 


2 


2 


3 


2 


3 


2 


2 


1 
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2 


1 


2 


2 


3 


1 


2 


2 


2 


1 


2 


3 


1 


1 


3 


1 


3 


2 


1 


2 


1 


3 


2 


3 


172 


1 


1 


1 


2 


2 


2 


3 


1 


2 


3 


1 


3 


2 


1 


3 


2 


2 


2 


1 


1 


3 


1 


3 


1 


173 
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1 
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1 
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2 
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1 
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3 


1 


1 


3 


1 


1 


1 


.2 


1 


2 


3 


1 


2 


1 


2 


3 


1 


1 


2 


1 


3 


176 


2 


2 


1 


1 


1 


3 


2 


2 


1 


2 


2 


3 


1 


1 


3 


2 


3 


1 


1 


3 


2 


2 


3 


1 


177 


2 


2 


3 


2 


1 


1 


3 


1 


1 


1 


2 


1 


3 


1 


3 


1 


2 


2 


2 


3 


2 


3 


2 


2 


178 


3 


1 


3 


1 


2 


2 


3 


1 


3 


2 


2 


2 


1 


1 


3 


2 


1 


2 


2 


1 


3 


1 


2 


2 


179 


1 


3 


2 


3 


1 


2 


1 


1 


2 


1 


3 


1 


1 


2 


3 


1 


2 


1 


1 


1 


2 


3 


2 


3 


180 


3 


1 


2 


1 


1 


2 


1 


3 


2 


3 


1 


1 


2 


2 


2 


3 


1 


3 


2 


2 


3 


2 


1 


2 


181 


1 


3 


1 


2 


1 


2 


2 


2 


3 


2 


1 


3 


2 


1 


3 


1 


1 


1 


3 


2 


1 


2 


3 


2 


182 


3 


2 


2 


1 


2 


3 


1 


1 


2 


3 


2 


2 


3 


1 


1 


2 


2 


2 


3 


1 


1 


2 


3 


2 


183 


1 


2 


3 


1 


1 


1 


3 


1 


2 


2 


2 


1 


3 


2 


2 


3 


2 


3 


1 


3 


1 


2 


1 


2 


184 
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Table IA: Numeric sequences corresponding to nucleotide 
base patterns of a set of oligonucleotides 

Numeric Pattern Sequence 

Identifier 



1 


1 


1 


2 


1 


3 


1. 


3 


1 


1 


3 


2 


2 


1 


2 


3 


1 


2 


3 


2 


3 


1 


2 


1 


185 


2 


2 


1 


3 


2 


3 


1 


3 


1 


1 


1 


2 


3 


2 


2 


2 


1 


1 


2 


3 


2 


3 


1 


2 


186 


2 


3 


1 


1 


3 


1 


1 


2 


1 


2 


3 


2 


3 


1 


1 


1 


2 


2 


1 


3 


2 


2 


2 


3 


187 


3 


2 


2 


2 


3 


1 


2 


1 


3 


2 


2 


2 


1 


1 


2 


3 


1 


3 


2 


1 


2 


2 


3 


1 


188 


3 


2 


2 


3 


2 


1 


1 


3 


2 


1 


1 


2 


3 


1 


2 


1 


1 


1 


3 


2 


1 


2 


3 


1 


189 


2 


1 


1 


3 


1 


3 


2 


1 


3 


2' 


1 


1 


2 


2 


3 


2 


2 


3 


2 


2 


2 


1 


3 


1 


190 


2 


2 


2 


3 


1 


3 


1 


3 


1 


3 


2 


1 


2 


3 


2 


1 


2 


3 


1 


2 


2 


1 


2 


2 


191 


1 


2 


2 


3 


1 


2 


2 


3 


2 


3 




1 


2 


2 


1 


3 


1 


2 


1 


3 


1 


1 


3 


1 


192 


3 


1 


2 


2 


1 


3 


2 


1 


2 


2 


2 


1 


3 


2 


1 


3 


2 


1 


1 


2 


1 


3 


1 


3 


193 


2 


1 


2 


3 


2 


1 


2 


2 


1 


3 


1 


3 


1 


2 


1 


2 


2 


3 


1 


1 


1 


3 


2 


3 


194 


2 


1 


2 


3 


2 


3 


1 


1 


1 


3 


2 


1 


1 


2 


3 


1 


2 


1 


1 


1 


2 


3 


1 


3 


195 


3 


2 


1 


1 


2 


2 


1 


3 


2 


1 


1 


2 


3 


1 


2 


2 


2 


3 


1 


1 


2 


3 


1 


3 


196 


3 


2 


2 


2 


1 


2 


2 


3 


2 


1 


1 


1 


3 


1 


2 


3 


2 


1 


1 


3 


2 


3 


1 


1 


197 


2 


1 


3 


2 


1 


3 


1 


1 


2 


2 


3 


2 


2 


3 


2 


2' 


1 


1 


1 


3 


1 


1 


2 


3 


198 


2 


1 


2 


2 


3 


2 


2 


1 


3 


2 


2 


1 


2 


3 


2 


1 


3 


2 


3 


2 


3 


2 


1 


1 


199 


3 


1 


3 


2 


3 


1 


1 


1 


3 


2 


2 


1 


2 


1 


2 


3 


1 


1 


1 


3 


2 


1 


2 


1 


200 


1 


2 


1 


2 


1 


3 


1 


1 


3 


2 


2 


3 


1 


2 


3 


1 


3 


2 


2 


2 


1 


2 


3 


1 


201 


2 


2 


2 


1 


3 


1 


1 


3 


2 


1 


1 


3 


1 


1 


2 


1 


1 


3 


2 


3 


1 


3 


2 


1 


202 


2 


3 


2 


3 


2 


1 


2 


1 


1 


3 


1 


2 


1 


2 


2 


2 


3 


2 


1 


1 


3 


1 


1 


3 


203 


2 


1 


3 


1 


1 


3 


1 


3 


2 


2 


3 


2 


1 


2 


2 


3 


2 


2 


1 


2 


1 


1 


3 


2 


204 


3 


2 


3 


2 


2 


1 


2 


2 


1 


3 


2 


2 


2 


1 


1 


3 


2 


2 


1 


3 


1 


3 


2 


1 


205 


1 


1 


2 


1 


2 


1 


3 


2 


3 


1 


2 


3 


2 


3 


1 


1 


1 


2 


2 


3 


1 


1 


2 


3 


206 


2 


2 


1 


3 


1 


3 


1 


1 


2 


1 


3 


1 


3 


2 


3 


1 


2 


2 


1 


2 


1 


3 


2 


2 


207 


3 


1 


1 


3 


2 


3 


1 


3 


2 


2 


1 


1 


2 


3 


1 


2 


2 


2 


3 


2 


1 


1 


1 


2 


208 


1 


1 


2 


3 


2 


1 


1 


1 


3 


2 


1 


1 


1 


3 


1 


1 


1 


3 


2 


3 


1 


2 


3 


1 


209 


3 


2 


2 


1 


3 


2 


2 


1 


2 


3 


1 


2 


3 


1 


1 


2 


1 


2 


2 


3 


2 


3 


2 


1 


210 


1 


1 


1 


2 


3 


1 


3 


2 


2 


1 


3 


1 


3 


2 


1 


3 


1 


1 


2 


2 


1 


2 


3 


2 


211 


3 


1 


2 


1 


2 


1 


3 


1 


1 


3 


1 


2 


2 


1 


3 


2 


2 


1 


3 


2 


3 


1 


2 


1 


212 


1 


2 


1 


3 


2 


2 


2 


3 


2 


2 


3 


1 


3 


1 


2 


2 


2 


1 


2 


3 


1 


3 


2 


1 


213 


2 


1 


3 


1 


1 


2 


1 


3 


2 


2 


1 


3 


2 


1 


3 


2 


1 


1 


3 


1 


3 


2 


1 


2 


214 


3 


1 


1 


2 


2 


2 


3 


2 


1 


2 


2 


3 


2 


3 


1 


1 


3 


2 


2 


2 


1 


3 


2 


1 


215 


3 


2 


1 


3 


2 


1 


1 


3 


1 


1 


3 


1 


3 


1 


1 


2 


2 


1 


3 


1 


2 


2 


1 


1 


216 
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2 


.3 


2 


3 


2 
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1 


2 


3 


2 


1 


2 


3 


2 


1 


1 


1 


2 


1 


3 


2 


3 


217 


3 


1 


1 


2 


2 


1 


3 


2 


2 


1 


3 


1 


3 


2 


1 


1 


1 


2 


2 


3 


2 


2 


2 


3 


218 


3 


1 


1 


1 


2 


2 


3 


1 


1 


3 


1 


2 


1 


3 


2 


1 


1 


3 


1 


1 


1 


2 


3 


1 


219 


3 


2 


3 


2 


1 


2 


2 


1 


2 


3 


2 


3 


1 


2 


2 


2 


1 


2 


3 


1 
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1 


3 


1 


220 


2 
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1 


3 
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2 


2 


3 
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1 
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3 


221 


2 , 


1 


2 


3 


2 


1 


2 


2 


3 


2 


1 


2 


2 


3 


1 


3 


2 


1 


3 


1 


2 


3 


1 


1 


222 


3 


2 


3 


1 


2 


2 


3 


1 


1 


2 


1 


3 


2 


1 


3 


1 


2 


2 


3 


2 


2 


2 


1 


1 


223 


1 


3 


2 


1 


1 


3 


2 


2 


3 


2 


2 


2 


3 


1 


2 


2 


3 


1 


1 


1 


2 


2 


2 


3 


224 


3 


1 


1 


3 


2 


2 


2 


3 


1 


2 


2 


2 


1 


1 


3 


2 


2 


2 


1 


1 


3 


1 


1 


3 


225 


3 


1 


3 


1. 


1 


3 


1 


2 


1 


1 


1 


2 


3 


1 


2 


1 


2 


2 


3 


2 


2 


1 


2 


3 


226 


1 


2 


3 


1 


2 


3 


1 


3 


2 


2 


3 


2 


2 


1 


1 


2 


1 


3 


2 


2 


1 


3 


2 


2 


227 


2 


1 


2 


3 


1 


2 


1 


2 


2 


2 


3 


1 


1 


3 


1 


3 


2 


3 


2 


2 


1 


1 


3 


1 


228 


3 


1 


3 


1 


2 


3 


1 


2 


2 


1 


1 


1 


3 


2 


3 


1 


2 


2 


2 


1 


2 


3 


1 


1 


229 


1 


2 


1 


3 


2 


2 


1 


1 


3 


1 


3 


2 


3 


1 


2 


3 


1 


3 


1 


1 


2 


1 


1 


1 


230 


2 


2 


2 


1 


2 


2 


3 


2 


2 


1 


3 


1 


2 


1 


1 


1 


3 


1 


3 


2 


2 


3 


1 


3 


231 


1 


3 


1 


1 


2 


1 


2 


2 


3 


1 


2 


1 


3 


2 


2 


3* 


1 


1 


3 


2 


2 


3 


1 


1 


232 


2 


1 


3 


2 


3 


2 


1 


1 


1 


3 


2 


3 


2 


1 


3 


1 


2 


2 


3 


2 


1 


1 


1 


2 


233 


1 


3 


2 


1 


3 


2 


3 


1 


2 


1 


2 


3 


1 


2 


2 


2 


3 


1 


1 


2 


1 


2 


2 


3 


234 


2 


3 


2 


1 


2 


2 


3 


1 


1 


2 


2 


1 


3 


1 


1 


2 


1 


3 


2 


3 


1 


3 


1 


1 


235 


2 


3 


1 


2 


1 


2 


3 


1 


3 


1 


2 


1 


3 


1 


1 


3 


2 


2 


2 


1 


1 


2 


3 


2 


236 


3 


1 


1 


3 


1 


1 


3 


2 


1 


1 


3 


2 


1 


2 


1 


1 


1 


3 


2 


1 


1 


1 


2 


3 


237 


2 


2 


2 


1 


1 


3 


2 


3 


2 


3 


1 


2 


1 


1 


3 


1 


1 


1 


3 


1 


2 


1 


3 


1 


238 


2 


1 


2 


2 


3 


2 


2 


3 


1 


1 


2 


3 


2 


3 


2 


2 


2 


1 


1 


1 


3 


1 


3 


1 


239 


3 


1 


1 


2 


1 


1 


2 


3 


1 


2 


3 


1 


3 


1 


2 


3 


1 


2 


2 


1 


2 


2 


3 


1 


240 
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Table IA: Numeric sequences corresponding to nucleotide 
base patterns of a set of oligonucleotides 

Numeric Pattern Sequence 

Identifier 



Z 


J. 


3 


X 


"3 

3 


X 


X 


X 


«a 
3 


X 


•a 
3 


X 


•a 
3 


X 


X 


2 


2 


3 


2 


X 


2 


»3 

2 


X 


X 


24X 


X 


2 


-> 
3 


9 
2 


X 


9 

2 


-1 
X 


X 


2 


«3 

3 


X 


•a 
3 


X 


2 


X 


2 


■3 

3 


2 


*3 

2 


2 


3 


2 


■3 

3 


X 


242 


X 


1 

JL 


9 
2 


1 


-3 

3 


1 
X 


9 
2 


X 


X 


•3 

3 


X 


2 


2 


•3 
3 


X 


2 


2 


3 


z 


«3 

3 


2 


2 


2 


•3 

3 


243 


9 
Z 


2 


9 

2 


•a 
3 


X 


9 

2 


3 


X 


2 


X 


X 


2 


X 


■3 
3 


X 


X 


3 


1 


■3 

3 


1 


1 


2 


3 


1 


244 


X 


•a 

•5 


X 


9 
Z 


■3 

3 


X 


X 


9 

2 


X 


X 


•a 
3 


•3 
2 


2 


•3 

3 


2 


*3 

3 


X 


1 


*3 

2 


«3 

3 


2 


2 


"3 

2 


X 


245 


X 


3 


X 


Z 


•a 
3 


X 


X 


X 


•a 
3 


X 


X 


X 


3 


2 


•3 
3 


2 


1 


3 


1 


1 


2 


1 


*3 

2 


2 


246 


2 


•3 


2 


2 


X 


X 


X 


2 


•3 
3 


2 


X 


2 


3 


2 


1 


3 


2 


1 


1 


2 


2 


3 


1 


3 


247 


2 


-1 
1 


3 


2 


X 


•3 

3 


2 


»3 

3 


2 


3 


1 


1 


3 


2 


2 


1 


2 


2 


2 


3 


2 


2 


1 


2 


248 


1 


3 


0 
2 


3 


X 


1 


2 


3 


2 


2 


2 


3 


2 


1 


1 


1 


3 


1 


3 


2 


2 


2 


1 


1 


249 


J 


1 


2 


1 


1 


1 


2 


3 


1 


3 


1 


1 


2 


2 


3 


1 


3 


2 


1 


1 


2 


2 


3 


2 


250 


•"> 

2 




X 


2 


3 


1 


3 


1 


1 


1 


2 


2 


3 


2 


2 


2 


1 


1 


3 


2 


3 


2 


2 


2 


251 


1 


1 


1 


2 


1 


1 


3 


2 


1 


3 


2 


3 


2 


3 


1 


3 


2 


1 


1 


2 


1 


3 


2 


1 


252 


2 


1 


2 


3 


1 


1 


1 


2 


1 


2 


3 


2 


3 


1 


2 


1 


3 


2 


1 


1 


3 


1 


3 


1 


253 


1 


2 


2 


3 


2 


1 


1 


3 


1 


3 


2 


3 


1 


2 


2 


1 


2 


1 


3 


1 


2 


3 


1 


2 


254 


1 


3 


1 


3 


2 


1 


1 


•3 
3 


1 


1 


2 


3 


1 


1 


1 


3 


1 


3 


1 


2 


1 


1 


2 


1 


255 


*3 

2 


1 


X 


3 


2 


1 


X 


«3 

3 


2 


1 


3 


1. 


2 


3 


2 


2 


1 


1 


1 


3 


1 


3 


1 


2 


256 


1 


1 


1 


2 


1 


3 


1 


1 


1 


3 


1 


1 


2 


2 


3 


2 


1 


3 


1 


3 


2 


1 


3 


2 


257 


1 


2 


1 


3 


1 


2 


2 


2 


1 


1 


3 


2 


3 
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Table XA: Numeric sequences corresponding to nucleotide 
base patterns of a set of oligonucleotides 

Numeric Pattern Sequence 
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Table XA: Numeric sequences corresponding to nucleotide 
base patterns of a set of oligonucleotides 

Numeric Pattern Sequence 
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Table IA: Numeric sequences corresponding to nucleotide 
base patterns of a set of oligonucleotides 
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Table IA: Numeric sequences corresponding to nucleotide 
base patterns of a set of oligonucleotides 

Numeric Pattern Sequence 
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Table IA: Numeric sequences corresponding to nucleotide 
base patterns of a set of oligonucleotides 

Numeric Pattern Sequence 
Identifier 
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Table XA: Numeric sequences corresponding to nucleotide 
base patterns of a set of oligonucleotides 

Numeric Pattern Sequence 
Identifier 
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Table IA: Numeric sequences corresponding to nucleotide 
base patterns of a set of oligonucleotides 

Numeric Pattern Sequence 

Identifier 
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Table IA: Numeric sequences corresponding to nucleotide 
base patterns of a set of oligonucleotides 

Numeric Pattern Sequence 

Identifier 
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Table XA: Numeric sequences corresponding to nucleotide 
base patterns of a set of oligonucleotides 

Numeric Pattern Sequence 

Identifier 
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Table IA: Numeric sequences corresponding to nucleotide 
base patterns of a set of oligonucleotides 

Numeric Pattern Sequence 
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Table IA: Numeric sequences corresponding to nucleotide 
base patterns of a set of oligonucleotides 

Numeric Pattern Sequence 
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Table IA: Numeric sequences corresponding to nucleotide 
base patterns of a set of oligonucleotides 
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Table IA: Numeric sequences corresponding to nucleotide 
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Table XA: Numeric sequences corresponding to nucleotide 
base patterns of a set of oligonucleotides 

Numeric Pattern Sequence 

Identifier 



1 


3 


2 


2 


2 


3 


1 


1 


1 


2 


2 


3 


2 


1 


1 


3 


2 


2 


2 


3 


1 


2 


3 


1 


1025 


2 


1 


3 


1 


1 


2 


2 


3 


1 


2 


2 


1 


1 


2 


3 


1 


2 


3 


1 


3 


2 


1 


3 


2 


1026 


1 


3 


1 


3 


1 


2 


2 


2 


3 


2 


1 


1 


2 


1 


1 


3 


2 


1 


2 


2 


3 


1 


1 


3 


' 1027 


1 


2 


1 


1 


2 


3 


1 


2 


3 


2 


1 


1 


2 


3 


2 


1 


1 


3 


2 


1 


3 


2 


3 


2 


1028 


2 


3 


1 


1 


1 


2 


2 


2 


3 


1 


2 


3 


1 


3 


1 


3 


1 


2 


1 


2 


3 


2 


2 


1 


1029 


2 


3 


2 


3 


2 


1 


1 


1 


3 


2 


1 


2 


1 


3 


2 


2 


2 


1 


2 


3 


2 


2 


1 


3 


1030 


2 


3 


1 


1 


2 


1 


1 


3 


2 


3 


1 


1 


1 


2 


1 


3 


1 


1 


2 


3 


1 


1 


2 


3 


1031 


1 


1 


1 


3 


1 


1 


1 


3 


1 


2 


2 


3 


2 


1 


1 


2 


1 


1 


3 


2 


1 


3 


1 


3 


1032 


1 


1 


2 


3 


1 


1 


1 


2 


1 


3 


2 


3 


2 


2 


1 


1 


1 


2 


3 


1 


3 


2 


3 


2 


1033 


3 


2 


1 


3 


1 


2 


1 


1 


1 


3 


1 


2 


3 


2 


3 


1 


1 


2 


2 


1 


2 


3 


1 


2 


1034 


3 


1 


2 


1 


3 


2 


1 


2 


1 


2 


3 


2 


3 


2 


3 


2 


1 


2 


2 


2 


3 


2 


2 


2 


1035 


1 


2 


3 


2 


2 


2 


3 


2 


1 


3 


1 


1 


1 


2 


3 


2 


2 


2 


3 


1 


1 


3 


1 


2 


1036 


1 


1 


1 


2 


2 


2 


3 


2 


1 


3 


1 


3 


1 


3 


1 


1 


1 


2 


2 


2 


3 


2 


2 


3 


1037 


2 


1 


3 


1 


1 


2 


1 


1 


3 


1 


2 


2 


1 


3 


2 


1 


1 


3 


2 


3 


2 


1 


3 


1 


1038 


2 


3 


1 


2 


2 


2 


1 


3 


1 


3 


1 


1 


1 


2 


1 


2 


3 


1 


3 


2 


1 


3 


1 


1 


1039 


1 


1 


2 


1 


3 


1 


3 


2 


1 


2 


3 


2 


2 


3 


2 


2 


2 


1 


2 


3 


1 


3 


1 


1 


1040 


3 


1 


2 


3 


1 


2 


3 


1 


1 


3 


1 


3 


2 


2 


2 


1 


2 


2 


3 


2 


1 


1 


1 


2 


1041 


1 


1 


3 


2 


1 


1 


1 


3 


1 


1 


3 


1 


1 


3 


1 


1 


1 


2 


3 


2 


3 


2 


2 


1 


1042 


2 


2 


3 


1 


1 


3 


1 


1 


2 


2 


1 


1 


3 


2 


3 


2 


2 


2 


1 


3 


2 


3 


2 


1 


1043 


1 


3 


1 


1 


1 


3 


1 


1 


2 


3 


2 


2 


3 


1 


2 


2 


2 


1 


2 


3 


1 


2 


3 


2 


1044 


3 


1 


2 


2 


1 


1 


1 


3 


1 


3 


1 


2 


3 


2 


2 


3 


1 


2 


2 


3 


1 


1 


1 


2 


1045 


1 


1 


2 


3 


1 


2 


1 


1 


2 


2 


3 


2 


2 


3 


1 


3 


1 


3 


1 


3 


2 


1 


1 


2 


1046 


3 


2 


2 


2 


3 


2 


2 


3 


1 


1 


1 


3 


2 


3 


2 


1 


1 


1 


3 


2 


1 


2 


1 


2 


' 1047 


2 


3 


1 


3 


2 


2 


1 


2 


1 


2 


3 


1 


3 


1 


1 


1 


3 


2 


3 


2 


1 


1 


2 


2 


1048 


2 


2 


3 


2 


3 


1 


3 


1 


1 


1 


3 


1 


1 


3 


2 


1 


2 


1 


2 


1 


3 


1 


1 


2 


1049 


3 


2 


1 


1 


3 


2 


2 


2 


1 


3 


1 


3 


2 


2 


1 


2 


1 


3 


1 


3 


2 


2 


2 


1 


' 1050 


3 


1 


2 


1 


3 


1 


2 


1 


3 


1 


2 


1 


•1 


3 


2 


2 


1 


1 


2 


2 


3 


1 


1 


3 


1051 


1 


3 


1 


3 


1 


2 


3 


1 


2 


2 


3 


2 


2 


2 


1 


2 


3 


2 


1 


2 


2 


1 


2 


3 


1052 


1 


1 


1 


3 


2 


2 


1 


1 


3 


1 


1 


1 


2 


2 


3 


2 


1 


3 


2 


3 


1 


2 


1 


3 


1053 


2 


2 


2 


3 


1 


2 


1 


2 


2 


3 


2 


2 


2 


3 


2 


3 


1 


3 


2 


3 


2 


1 


2 


1 


1054 


1 


2 


2 


2 


3 


2 


1 


3 


1 


1 


1 


3 


2 


2 


3 


2 


2 


1 


2 


3 


1 


3 


2 


2 


1055 


3 


1 


2 


2 


2 


3 


1 


3 


2 


1 


1 


3 


2 


2 


2 


1 


2 


1 


3 


1 


2 


3 


1 


1 


1056 


1 


1 


3 


1 


2 


1 


1 


1 


3 


2 


3 


1 


3 


2 


2 


3 


1 


2 


2 


2 


1 


3 


1 


2 


1057 


3 


1 


2 


1 


2 


2 


3 


2 


1 


1 


3 


1 


2 


1 


2 


3 


2 


2 


3 


2 


1 


1 


1 


3 


1058 


3 


2 


1 


1 


3 


1 


3 


2 


3 


2 


1 


2 


2 


3 


2 


1 


1 


3 


2 


2 


1 


1 


2 


2 


1059 


3 


2 


3 


2 


3 


1 


2 


2 


1 


3 


2 


1 


1 


2 


3 


1 


1 


3 


2 


1 


2 


2 


2 


1 


1060 


3 


2 


1 


1 


3 


1 


1 


1 


3 


1 


2 


2 


1 


1 


3 


2 


3 


2 


2 


1 


3 


2 


1 


1 


1061 


1 


3 


2 


1 


3 


1 


1 


1 


3 


2 


2 


3 


1 


1 


1 


2 


2 


3 


1 


2 


2 


1 


2 


3 


1062 


2 


1 


1 


3 


1 


3 


1 


1 


3 


2 


2 


3 


1 


3 


2 


1 


1 


2 


3 


2 


1 


2 


2 


2 


1063 


3 


2 


2 


1 


1 


3 


1 


1 


1 


2 


1 


3 


2 


1 


3 


1 


2 


1 


1 


3 


2 


3 


1 


1 


1064 


2 


1 


1 


3 


2 


1 


1 


1 


2 


2 


3 


1 


1 


1 


3 


2 


3 


2 


1 


2 


1 


3 


2 


3 


1065 


1 


1 


3 


1 


2 


3 


2 


1 


2 


3 


2 


2 


2 


1 


2 


2 


3 


2 


2 


3 


2 


3 


2 


1 


1066 


1 


2 


2 


2 


1 


3 


1 


1 


2 


1 


2 


1 


3 


2 


3 


1 


1 


3 


1 


3 


1 


2 


1 


3 


1067 


3 


2 


2 


1 


2 


3 


1 


1 


1 


3 


1 


3 


2 


1 


2 


3 


2 


3 


2 


2 


1 


1 


1 


2 


1068 


2 


1 


2 


2 


1 


2 


3 


2 


3 


1 


1 


3 


1 


1 


3 


1 


1 


2 


3 


1 


2 


2 


1 


3 


1069 


2 


1 


1 


2 


1 


1 


3 


2 


2 


3 


1 


1 


3 


1 


3 


1 


1 


2 


2 


3 


2 


2 


3 


2 


1070 


2 


3 


1 


2 


3 


2 


2 


2 


3 


1 


2 


3 


2 


1 


1 


2 


2 


3 


2 


2 


1 


1 


1 


3 


1071 


3 


2 


3 


1 


1 


1 


3 


1 


2 


2 


2 


3 


1 


3 


2 


2 


2 


3 


2 


1 


2 


1 


1 


2 


1072 


1 


3 


1 


3 


1 


1 


2 


1 


2 


1 


3 


1 


2 


2 


3 


1 


3 


1 


2 


2 


2 


3 


2 


2 


1073 


2 


2 


2 


3 


1 


3 


1 


2 


3 


2 


3 


1 


2 


3 


1 


2 


1 


1 


1 


3 


2 


2 


1 


1 


1074 


3 


2 


2 


3 


2 


1 


1 


1 


2 


2 


3 


2 


1 


3 


2 


1 


1 


1 


3 


1 


1 


3 


2 


1 


1075 


3 


2 


3 


2 


2 


1 


2 


3 


1 


2 


3 


2 


2 


3 


2 


2 


2 


3 


2 


1 


2 


2 


1 


2 


1076 


1 


2 


2 


1 


2 


2 


3 


2 


3 


2 


1 


3 


1 


2 


3 


2 


1 


2 


2 


1 


1 


3 


1 


3 


1077 


3 


2 


2 


1 


3 


1 


1 


1 


3 


1 


2 


2 


2 


1 


3 


1 


1 


3 


2 


2 


1 


3 


2 


2 


1078 


2 


2 


3 


2 


3 


2 


1 


2 


2 


1 


1 


3 


1 


3 


1 


3 


2 


3 


1 


1 


1 


2 


1 


2 


1079 


3 


2 


2 


2 


1 


1 


3 


1 


2 


1 


3 


1 


1 


1 


3 


1 


3 


2 


3. 


1 


2 


2 


2 


1 


1080 



WO 02/059355 



PCT/CA02/00089 



- 25 - 

Table IA: Numeric sequences corresponding to nucleotide 
base patterns of a set of oligonucleotides 

Numeric Pattern Sequence 

Identifier 



1 


1 


2 


3 


r 


3 


1 


1 


1 


2 


1 


3 


1 


2 


1 


3 


2 


2 


1 


2 


2 


3 


2 


3 


1081 


2 


3 


1 


1 


2 


2 


3 


1 


1 


2 


1 


1 


3 


1 


1 


2 


2 


2 


3 


2 


2 


3 


2 


3 


1082 


1 


1 


2 


1 


1 


3 


1 


2 


2 


3 


1 


1 


2 


2 


1 


3 


2 


3 


1 


3 


2 


1 


1 


3 


1083 


1 


1 


2 


3 


2 


2 


2 


3 


1 


3 


1 


3 


1 


2 


2 


2 


1 


3 


2 


1 


1 


1 


3 


1 


1084 


1 


3 


2 


2 


2 


1 


3 


1 


1 


2 


1 


3 


1 


1 


1 


2 


3 


2 


3 


2 


2 


2 


3 


1 


1085 


2 


1 


2 


1 


1 


3 


2 


1 


1 


3 


2 


3 


2 


2 


1 


1 


3 


1 


2 


2 


2 


3 


.1 


3 


1086 


3 


2 


1 


3 


2 


3 


1 


1 


2 


1 


1 


3 


2 


2 


1 


3 


2 


3 


2 


2 


1 


1 


2 


1 


1087 


1 


1 


3 


2 


3 


2 


3 


2 


2 


1 


1 


1 


3 


2 


1 


1 


1 


2 


3 


2 


1 


3 


1 


2 


1088 


1 


3 


1 


3 


1 


2 


3 


2 


2 


2 


1 


2 


3 


2 


2 


3 


2 


3 


1 


1 


2 


2 


1 


1 


1089 


1 


3 


2 


2 


3 


1 


1 


2 


1 


2 


2 


3 


1 


2 


3 


1 


2 


1 


1 


3 


1 


1 


3 


1 


1090 


2 


3 


1 


1 


2 


3 


2 


3 


1 


3 


1 


2 


3 


2 


2 


2 


1 


3 


1 


1 


2 


1 


1 


2 


1091 


1 


1 


2 


1 


1 


2 


3 


1 


2 


3 


2 


1 


1 


3 


2 


2 


2 


3 


1 


3 


2 


2 


2 


3 


1092 


1 


1 


1 


3 


X 


3 


2 


3 


1 


1 


2 


1 


3 


1 


1 


1 


2 


1 


1 


3 


1 


3 


1 


1 


1093 


1 


1 


2 


1 


1 


1 


3 


2 


2 


1 


2 


2 


3 


1 


3 


1 


3 


1 


3 


2 


2 


2 


1 


3 


1094 


1 


3 


2 


1 


3 


2 


3 


2 


2 


3 


2 


1 


3 


2 


2 


2 


1 


3 


2 


1 


2 


1 


2 


1 


1095 


3 


2 


1 


1 


3 


1 


1 


2 


3 


2 


1 


2 


2 


1 


3 


1 


2 


1 


2 


2 


2 


3 


2 


3 


1096 


3 


1 


2 


1 


1 


1 


2 


3 


2 


2 


2 


3 


1 


2 


1 


1 


1 


3 


2 


1 


3 


2 


2 


3 


1097 


1 


2 


1 


3 


2 


1 


2 


3 


2 


1 


2 


3 


2 


3 


2 


3 


1 


1 


3 


1 


2 


2 


2 


1 


1098 


1 


2 


3 


1 


1 


2 


3 


2 


1 


3 


1 


3 


2 


3 


1 


2 


2 


1 


3 


2 


2 


2 


1 


1 


1099 


3 


2 


1 


3 


2 


1 


2 


2 


2 


1 


3 


2 


3 


1 


2 


3 


2 


1 


1 


3 


1 


1 


2 


1 


1100 


1 


3 


1 


1 


2 


2 


3 


2 


1 


2 


2 


3 


1 


1 


3 


1 


1 


3 


1 


1 


2 


1 


2 


3 


1101 


2 


2 


2 


1 


2 


1 


3 


1 


1 


2 


2 


3 


1 


3 


1 


3 


1 


1 


3 


2 


2 


1 


1 


3 


1102 


1 


1 


1 


3 


2 


1 


3 


2 


1 


3 


1 


3 


1 


2 


2 


2 


3 


1 


3 


1 


1 


2 


2 


1 


1103 


2 


2 


2 


1 


1 


1 


3 


1 


1 


1 


3 


2 


1 


2 


2 


3 


2 


1 


1 


3 


1 


3 


2 


3 


1104 


1 


1 


1 


2 


2 


3 


1 


3 


1 


1 


1 


3 


2 


3 


1 


1 


2 


3 


1 


1 


3 


2 


2 


2 


1105 


1 


1 


3 


1 


1 


1 


2 


1 


1 


3 


2 


1 


2 


3 


1 


2 


1 


3 


2 


1 


3 


2 


1 


3 


1106 


1 


2 


2 


2 


3 


1 


1 


2 


2 


3 


2 


1 


2 


2 


3 


2 


1 


3 


2 


2 


2 


3 


2 


3 


1107 


1 


1 


3 


1 


3 


1 


1 


2 


1 


1 


2 


3 


2 


1 


3 


1 


3 


1 


2 


1 


2 


1 


1 


3 


1108 


2 


3 


2 


3 


2 


1 


1 


2 


1 


3 


2 


2 


3 


2 


2 


1 


1 


2 


3 


1 


3 


2 


1 


1 


1109 


2 


1 


2 


1 


3 


2 


2 


3 


2 


1 


3 


2 


2 


2 


1 


3 


1 


2 


3 


1 


1 


2 


3 


2 


1110 


1 


2 


2 


3 


2 


3 


2 


2 


1 


3 


1 


1 


2 


3 


1 


2 


3 


2 


2 


1 


1 


2 


1 


3 


1111 


3 


2 


2 


2 


3 


2. 


1 


2 


1 


3 


2 


1 


2 


2 


2 


3 


1 


2 


2 


3 


1 


2 


3 


2 


1112 


1 


3 


1 


3 


2 


1 


1 


1 


3 


2 


1 


2 


3 


1 


3 


2 


2 


1 


2 


3 


1 


1 


2 


1 


1113 


3 


1 


1 


1 


3 


2 


2 


2 


1 


1 


3 


2 


3 


1 


2 


3 


2 


1 


2 


1 


2 


2 


3 


2 


1114 


2 


2 


1 


1 


1 


2 


3 


1 


2 


1 


1 


1 


3 


1 


3 


2 


1 


3 


2 


3 


1 


1 


3 


2 


1115 


2 


2 


1 


1 


1 


2 


3 


2 


3 


2 


3 


1 


3 


1 


1 


3 


1 


2 


3 


1 


1 


2 


1 


1 


1116 


1 


2 


2 


2 


3 


2 


1 


2 


1 


1 


1 


3 


2 


3 


1 


1 


3 


1 


1 


3 


1 


3 


1 


1 


1117 


2 


3 


1 


2 


2 


1 


3 


2 


1 


2 


2 


2 


3 


2 


3 


1 


1 


3 


1 


3 


1 


2 


2 


2 


1118 


2 


2 


2 


3 


1 


1 


2 


3 


1 


1 


1 


2 


2 


3 


1 


2 


3 


1 


2 


1 


3 


1 


2 


3 


1119 


1 


3 


1 


3 


2 


1 


1 


3 


1 


2 


2 


1 


1 


3 


1 


1 


2 


1 


1 


3 


1 


1 


1 


3 


1120 


1 


2 


2 


3 


1 


1 


2 


2 


3 


1 


3 


1 


1 


3 


2 


3 


1 


1 


3 


2 


1 


1 


1 


2 


1121 


2 


2 


2 


1 


3 


1 


3 


1 


1 


3 


2 


1 


2 


2 


3 


2 


2 


2 


3 


1 


1 


1 


3 


1 


1122 


2 


1 


1 


1 


3 


2 


3 


1 


1 


1 


3 


1 


2 


2 


2 


3 


1 


1 


1 


2 


3 


1 


2 


3 


1123 


3 


1 


1 


1 


3 


2 


2 


1 


3 


1 


3 


1 


1 


1 


2 


3 


2 


1 


3 


1 


1 


1 


2 


2 


1124 


3 


2 


3 


1 


1 


2 


1 


1 


2 


3 


1 


1 


3 


1 


1 


3 


2 


2 


1 


2 


3 


2 


2 


1 


1125 


2 


2 


3 


2 


3 


1 


1 


2 


1 


1 


1 


3 


2 


1 


3 


1 


2 


3 


2 


3 


2 


2 


1 


2 


1126 


2 


2 


1 


2 


1 


2 


3 


1 


2 


1 


2 


3 


1 


3 


2 


'2 


2 


3 


2 


3 


2 


2 


3 


1 


1127 


2 


2 


3 


1 


2 


2 


2 


3 


2 


3 


2 


3 


1 


3 


2 


1 


2 


2 


1 


3 


2 


2 


1 


2 


1128 


1 


1 


1 


3 


2 


3 


1 


2 


2 


1 


1 


3 


2 


2 


1 


3 


2 


2 


2 


3 


1 


3 


1 


2 


1129 


2 


2 


3 


2 


1 


2 


2 


2 


3 


2 


1 


2 


1 


1 


2 


3 


2 


2 


3 


1 


1 


3 


1 


3 


1130 


3 


2 


2 


2 


3 


1 


1 


1 


2 


2 


1 


3 


2 


3 


2 


3 


1 


3 


1 


1 


1 


2 


1 


2 


1131 


1 


1 


2 


3 


2 


2 


3 


1 


3 


1 


2 


2 


3 


1 


2 


1 


1 


2 


3 


2 


2 


3 


1 


1 


1132 


2 


1 


3 


2 


1 


3 


2 


1 


3 


2 


1 


2 


2 


3 


2 


2 


3 


2 


1 


1 


2 


1 


1 


3 


1133 


3 


2 


2 


3 


2 


1 


1 


2 


2 


2 


3 


1 


3 


2 


3 


2 


2 


1 


3 


2 


2 


1 


2 


2 


1134 


2 


3 


1 


1 


2 


1 


2 


3 


1 


2 


1 


3 


2 


2 


1 


3 


2 


1 


1 


2 


2 


3 


2 


3 


1135 


2 


3 


1 


2 


1 


3 


2 


1 


2 


3 


2 


2 


2 


3 


2 


3 


1' 


2 


2 


1 


1 


1 


3 


1 


1136 



WO 02/059355 



PCT/CA02/00089 



- 26 - 

Table IA: Numeric sequences corresponding to nucleotide 
base patterns of a set of oligonucleotides 

Numeric Pattern Sequence* 

Identifier 



3 


1 


2 


3 


2 


l 


2 


1 


1 


1 


3 


1 


3 


2 


1 


2 


3 


2 


2 


1 


2 


1 


1 


3 


1137 


1 


3 


2 


3 


1 


3 


1 


2 


2 


2 


1 


3 


1 


1 


3 


1 


2 


3 


2 


2 


1 


2 


2 


1 


1138 


1 


.2 


3 


1 


3 


1 


1 


2 


2 


2 


3 


2 


2 


1 


1 


1 


3 


1 


3 


1 


1 


1 


3 


2 


• 1139 


1 


1 


1 


3 


1 


1 


2 


2 


1 


3 


2 


1 


2 


3 


1 


2 


1 


3 


1 


2 


3 


1 


3 


1 


1140 


2 


1 


3 


1 


3 


2 


2 


3 


2 


1 


2 


1 


3 


2 


2 


2 


1 


2 


1 


3 


2 


2 


3 


1 


1141 


3 


2 


1 


3 


1 


1 


2 


3 


1 


2 


2 


3 


2 


2 


2 


1 


3 


1 


1 


3 


1 


2 


2 


2 


1142 


3 


2 


2 


2 


1 


2 


3 


2 


2 


2 


3 


1 


3 


1 


1 


3 


1 


3 


2 


2 


1 


2 


2 


2 


1143 


2 


1 


3 


1 


1 


3 


2 


2 


2 


3 


1 


1 


1 


3 


2 


2 


1 


2 


2 


3 


1 


2 


2 


3 


1144 


3 


1 


2 


3 


1 


1 


3 


1 


3 


2 


1 


2 


2 


2 


3 


2 


2 


1 


2 


1 


2 


3 


2 


1 


1145 


3 


1 


2 


3 


1 


1 


2 


1 


2 


1 


3 


2 


1 


1 


3 


2 


1 


2 


2 


3 


1 


3 


2 


1 


1146 


2 


1 


3 


2 


3 


1 


2 


3 


1 


1 


1 


2 


2 


2 


3 


1 


3 


1 


2 


1 


3 


1 


2 


1 


1147 


3 


1 


1 


1 


3 


1 


1 


1 


2 


2 


3 


1 


1 


3 


1 


3 


2 


2 


2 


3 


1 


2 


1 


2 


1148 


1 


2 


2 


2 


3 


1 


3 


2 


1 


2 


2 


2 


3 


2 


3 


2 


1 


2 


2 


3 


1 


1 


2 


3 


1149 


1 


2 


3 


1 


3 


2 


2 


3 


1 


1 


1 


2 


2 


2 


3 


1 


1 


3 


2 


1 


2 


2 


3 


2 


1150 


2 


2 


1 


1 


2 


1 


3 


2 


3 


1 


3 


1 


3 


1 


3 


2 


1 


2 


1 


2 


3 


2 


1 


1 


1151 


1 


2 


2 


1 


1 


3 


1 


3 


1 


3 


2 


3 


1 


3 


2 


1 


1 


1 


2 


3 


2 


1 


1 


1 


1152 


1 


1 


3 


1 


1 


2 


1 


3 


1 


2 


3 


1 


3 


1 


2 


2 


1 


3 


1 


1 


1 


2 


1 


3 


1153 


1 


3 


2 


2 


2 


1 


1 


1 


3 


1 


3 


2 


2 


1 


3 


1 


1 


2 


2 


3 


1 


1 


1 


3 


1154 


3 


2 


1 


1 


3 


1 


2 


2 


2 


3 


2 


2 


3 


1 


1 


2 


1 


1 


1 


3 


1 


1 


3 


1 


1155 


1 


3 


1 


3 


1 


1 


1 


3 


1 


1 


3 


2 


2 


1 


1 


1 


3 


2 


3 


1 


2 


1 


2 


2 


1156 


2 


1 


1 


2 


1 


3 


1 


3 


1 


1 


3 


1 


3 


1 


2 


3 


2 


1 


2 


3 


1 


1 


2 


1 


1157 


2 


2 


1 


2 


2 


1 


3 


2 


3 


1 


2 


1 


1 


3 


2 


3 


1 


1 


3 


2 


2 


2 


1 


3 


1158 


1 


2 


1 


1 


2 


3 


2 


1 


1 


1 


3 


1 


2 


3 


1 


3 


2 


2 


2 


1 


2 


3 


1 


3 


1159 


2 


2 


3 


1 


2 


2 


2 


3 


1 


3 


1 


3 


2 


2 


3 


1 


2 


1 


1 


3 


1 


2 


2 


2 


1160 


1 


2 


3 


1 


2 


2 


1 


2 


2 


3 


2 


3 


2 


3 


2 


1 


3 


1 


1 


2 


2 


1 


3 


1 


1161 


2 


1 


2 


1 


1 


1 


3 


1 


2 


1 


2 


1 


3 


2 


1 


3 


1 


2 


3 


1 


2 


3 


2 


3 


1162 


2 


2 


2 


1 


3 


2 


2 


3 


1 


3 


1 


2 


3 


1 


1 


3 


2 


2 


1 


2 


2 


1 


3 


1 


1163 


1 


2 


2 


3 


1 


1 


2 


2 


3 


1 


2 


1 


2 


1 


3 


2 


3 


2 


1 


1 


1 


3 


2 


3 


1164 


3 


1 


1 


3 


1 


1 


1 


3 


1 


2 


2 


1 


2 


2 


3 


2 


1 


2 


2 


3 


1 


3 


2 


2 


1165 


1 


2 


2 


3 


r 


3 


2 


3 


2 


1 


3 


2 


3 


1 


2 


2 


2 


1 


3 


1 


1 


1 


2 


1 


1166 


1 


1 


2 


1 


i 


1 


3 


2 


3 


2 


2 


2 


1 


1 


3 


1 


3 


2 


1 


3 


1 


3 


2 


1 


1167 


3 


2 


1 


3 


i 


3 


1 


2 


1 


1 


2 


2 


3 


1 


2 


3 


2 


3 


2 


1 


1 


2 


2 


2 


1168 



In Table IA, each of the numerals 1 to 3 {numeric identifiers) 
represents a nucleotide base and the pattern of numerals 1 to 3 of the 
sequences in the above list corresponds to the pattern of nucleotide 
5 bases present in the oligonucleotides of Table I, which oligonucleotides 
have been found to be non- cross -hybridizing, as described further in the 
detailed examples. Each nucleotide base is selected from the group of 
nucleotide bases consisting of A, C f G, and T/U. A particularly 
preferred embodiment of the invention, in which a specific base is 
10 assigned to each numeric identifier is shown in Table I, below. 

In one broad aspect, the invention is a composition comprising 
molecules for use as tags or tag complements wherein each molecule 
comprises an oligonucleotide selected from a set of oligonucleotides 
15 based on a group of sequences as specified by numeric identifiers set 

out in Table IA. In the sequences, each of 1 to 3 is a nucleotide base 
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selected to be different from the others of 1 to 3 with the proviso that 
up to three nucleotide bases of each sequence can be substituted with 
any nucleotide base provided that: 

for any pair of sequences of the set: 

Ml <; 15, M2 s 12, M3 £ 19, M4 <; 15, and M5 * 18, where: 
Ml is the maximum number of matches for any alignment in which there 
are no internal indels; 

M2 is the maximum length of a block of matches for any alignment; 

M3 is the maximum number of matches for any alignment having a maximum 

score; 

M4 is the maximum sum of the lengths of the longest two blocks of 
matches for any alignment of maximum score; and 

M5 is the maximum sum of the lengths of all the blocks of matches 
having a length of at least 3, for any alignment of maximum score; 
wherein: 

the score of an alignment is determined according to the equation (A 
x m) (B x mm) - (C x (og + eg) ) - (D x eg) ) , wherein: 
for each of (i) to (iv) : 

(i) m=6,mm=6 f og=0 and eg = 6, 

(ii) m=6, mm = 6, og = 5 and eg = 1, 

(iii) m = 6, mm = 2, og = 5 and eg = 1, and 

(iv) m = S , mm = 6 , og = 6 and eg = 0, 

A is the total number of matched pairs of bases in the alignment; 
B is the total number of internal mismatched pairs in the 
alignment; 

C is the total number of internal gaps in the alignment; and 
D is the total number of internal indels in the alignment minus 
the total number of internal gaps in the alignment; and 
wherein the maximum score is determined separately for each of (i) , 
(ii), (iii) and (iv). 
An explanation of the meaning of the parameters set out above is 
5 given in the section describing detailed embodiments. 

In another broad aspect, the invention is a composition containing 
molecules for use as tags or tag complements wherein each molecule 
comprises an oligonucleotide selected from a set of oligonucleotides 
based on a group of sequences as set out in Table IA wherein each of 1 
10 to 3 is a nucleotide base selected to be different from the others of 1 
to 3 with the proviso that up to three nucleotide bases of each sequence 
can be substituted with any nucleotide base provided that: 
for any pair of sequences of the set: 
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Ml * 18, M2 s 16, M3 s 20, M4 <s 17, and M5 £ 19, where: 
Ml is the maximum number of matches for any alignment in which there 
are no internal indels; 

M2 is the maximum length of a block of matches for any alignment; 

M3 is the maximum number of matches for any alignment having a maximum 

score; 

M4 is the maximum sum of the lengths of the longest two blocks of 
matches for any alignment of maximum score; and 

M5 is the maximum sum of the lengths of all the blocks of matches 
having a length of at least 3, for any alignment of maximum score; 
wherein 

the score of an alignment iB determined according to the equation (A 
x m) - (B x mm) - (C x (og + eg) ) - (D x eg)), wherein: 
for each of (i) to (iv) : 

(i) m - 6, mm = 6, og = 0 and eg = 6, 

(ii) m = 6 , mm = 6 , og = 5 and eg « 1, 

(iii) m = 6, mm = 2, og = 5 and eg = 1, and 

(iv) m = 6 , mm = 6 , og = 6 and eg » 0, 

A is the total number of matched pairs of bases in the alignment; 
B is the total number of internal mismatched pairs in the 
alignment; 

C is the total number of internal gaps in the alignment; and 
D is the total number of internal indels in the alignment minus 
the total number of internal gaps in the alignment; and 
wherein the maximum score is determined separately for each of (i) , 
(ii) , (iii) and (iv) . 
In another broad aspect, the invention is a composition comprising 
molecules for use as tags or tag complements wherein each molecule 
comprises an oligonucleotide selected from a set of oligonucleotides 
based on a group of sequences set out in Table IA wherein each of 1 to 3 
5 is a nucleotide base selected to be different from the others of 1 to 3 
with the proviso that up to three nucleotide bases of each sequence can 
be substituted with any nucleotide base provided that: 
for any pair of sequences of the set : 

Ml s 18, M2 s 16, M3 s 20, M4 s 17, and M5 £ 19, where: 
Ml is the maximum number of matches for any alignment in which there 
are no internal indels; 

M2 is the maximum length of a block of matches for any alignment; 

M3 is the maximum number of matches for any alignment having a maximum 
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score; 

M4 is the maximum sum of the lengths of the longest two blocks of 
matches for any alignment of maximum score; and 

M5 is the maximum sum of the lengths of all the blocks of matches 
having a length of at least 3, for any alignment of maximum score, 
wherein: 

the score of an alignment is determined according to the equation 3A 
- B - 3C - D, wherein: 

A is the total number of matched pairs of bases in the alignment; 

B is the total number of internal mismatched pairs in the 

alignment; 

C is the total number of internal gaps in the alignment; and 
D is the total number of internal indels in the alignment minus 
the total number of internal gaps in the alignment; and 
In preferred aspects, the invention provides a composition in 
which, for the group of 24mer sequences in which 1 = A, 2 = T and 3 = G, 
under a defined set of conditions in which the maximum degree of 
hybridization between a sequence and any complement of a different 
5 sequence of the group of 24mer sequences does - not exceed 3 0% of the 
degree of hybridization between said sequence and its complement, for 
all said oligonucleotides of the composition, the maximum degree of 
hybridization between an oligonucleotide and a complement of any other 
oligonucleotide of the composition does not exceed 50% of the degree of 
10 hybridization of the oligonucleotide and its complement. 

More preferably, the maximum degree of hybridization between a 
sequence and any complement of a different sequence does not exceed 30% 
of the degree of hybridization between said sequence and its complement, 
the degree of hybridization between each- sequence and its complement 
15 varies by a factor of between 1 and up to 10, more preferably between 1 
and up to 9, more preferably between 1 and up to 8, more preferably 
between 1 and up to 7, more preferably between 1 and up to 6, and more 
preferably between 1 and up to 5 . 

It is also preferred that the maximum degree of hybridization 
20 between a sequence and any complement of a different sequence does not 
exceed 25%, more preferably does not exceed 20%, more preferably does 
not exceed 15%, more preferably does not exceed 10%, more preferably 
does not exceed 5%. 

Even more preferably, the above -referenced defined set of 
25 conditions results in a level of hybridization that is the same as the 
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level of hybridization obtained when hybridization conditions include 
0.2 M NaCl, 0.1 M Tris, 0.08% Triton X-100, pH 8.0 at 37°C. 

In the composition, the defined set of conditions can include the 
group of 24raer sequences being covalently linked to beads. 
5 In a particular preferred aspect, for the group of 24mers the 

maximum degree of hybridization between a sequence and any complement of 
a different sequence does not exceed 15% of the degree of hybridization 
between said sequence and its complement and the degree of hybridization 
between each sequence and its complement varies by a factor of between 1 
10 and up to 9, and for all oligonucleotides of the set, the maximum degree 
of hybridization between an oligonucleotide and a complement of any 
other oligonucleotide of the set does not exceed 20% of the degree of 
hybridization of the oligonucleotide and its complement. 

It is possible that each 1 is one of A, T/U, G and C; each 2 is 
15 one of A, T/U, G and C; and each 3 is one of A, T/U, G and C; and each 
of 1, 2 and 3 is selected so as to be different from all of the others 
of 1, 2 and 3. More preferably, 1 is A or T/U, 2 is A or T/U and 3 is G 
or C. Even more preferably, 1 is A, 2 is T/U, and 3 is G. 

In certain preferred composition, each of the oligonucleotides is 
20 from twenty-two to twenty-six bases in length, or from twenty-three to 
twenty- five, and preferably, each oligonucleotide is of the same length 
as every other said oligonucleotide. 

In a particularly preferred embodiment, each oligonucleotide is 
twenty- four bases in length. 
25 It is preferred that no oligonucleotide contains more than four 

contiguous bases that are identical to each other. 

It is also preferred that the number of G' s in each 
oligonucleotide does not exceed L/4 where L is the number of bases in 
said sequence. 

30 For reasons described below, the number of G' s in each said 

oligonucleotide is preferred not to vary from the average number of G' s 
in all of the oligonucleotides by more than one. Even more preferably, 
the number of G' s in each said oligonucleotide is the same as every 
other said oligonucleotide. In the embodiment disclosed below in which 

35 oligonucleotides were tested, the sequence of each was twenty-four bases 
in length and each oligonucleotide contained 6 G's. 

It is also preferred that, for each nucleotide, there is at most 
six bases other than G between every pair of neighboring pairs of G's. 

Also, it is preferred that, at the 5' -end of each oligonucleotide 

40 at least one of the first, second, third, fourth, fifth, sixth and 
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seventh bases of the sequence of the oligonculeotide is a G. Similarly, 
it is preferred, at the 3' -end of each oligonucleotide that at least one 
of the first, second, third, fourth, fifth, sixth and seventh bases of 
the sequence of the oligonucleotide is a G. 

It is possible to have sequence compositions that include one 
hundred and sixty said molecules, or that include one hundred and 
seventy said molecules, or that include one hundred and eighty said 
molecules, or that include one hundred and ninety said molecules, or 
that include two hundred said molecules, or that include two hundred and 
twenty said molecules, or that include two hundred and forty said 
molecules, or that include two hundred and sixty said molecules, or that 
include two hundred and eighty said molecules, or that include three 
hundred said molecules, or that include four hundred said molecules, or 
that include five hundred said molecules, or that include six hundred 
said molecules, or that include seven hundred said molecules, or that 
include eight hundred said molecules, or that include nine hundred said 
molecules, or that include one thousand said molecules. 

It is possible, in certain applications, for each molecule to be 
linked to a solid phase support so as to be distinguishable from a 
mixture containing other of the molecules by hybridization to its 
complement. Such a molecule can be linked to a defined location on a 
solid phase support such that the defined location for each molecule is 
different than the defined location for different others of the 
molecules. 

In certain embodiments, each solid phase support is a 
microparticle and each said molecule is covalently linked to a different 
microparticle than each other different said molecule. 

In another broad aspect, the invention is a composition comprising 
a set of 150 molecules for use as tags or tag complements wherein each 
molecule includes an oligonucleotide having a sequence of at least 
sixteen nucleotide bases wherein for any pair of sequences of the set: 

, Ml > 19/24 x LI, M2 > 17/24 X LI, M3 > 21/24 x LI, M4 > 18/24 x LI, M5 > 

20/24 x LI, where LI is the length of the shortest sequence of the pair, 

where : 

Ml is the maximum number of matches for any alignment of the pair of 
sequences in which there are no internal indels; 

M2 is the maximum length of a block of matches for any alignment of the 
pair of sequences; 

M3 is the maximum number of matches for any alignment of the pair of 
sequences having a maximum score; 
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M4 is the maximum sum of the lengths of the longest two blocks of 
matches for any alignment of the pair of sequences of maximum score; 
and 

M5 is the maximum sum of the lengths of all the blocks of matches 
having a length of at least 3, for any alignment of the pair of 
sequences of maximum score, wherein: 

the score of an alignment is determined according to the equation (A 
x m) - (B x mm) - (C x (og + eg) ) - (D x eg) ) , wherein: 
for each of (i) to (iv) : 

(i) m « 6, mm = 6, og = 0 and eg = 6, 

(ii) m = 6, mm = 6, og =5 and eg = 1, 

(iii) m = 6, mm = 2, og = 5 and eg = 1, and 

(iv) m = 6, mm = 6, og = 6 and eg = 0, 

A is the total number of matched pairs of bases in the alignment; 
B is the total number of internal mismatched pairs in the 
alignment; 

C is the total number of internal gaps in the alignment; and 
D is the total number of internal indels in the alignment minus 
the total number of internal gaps in the alignment; and 
wherein the maximum score is determined separately for each of (i) , 
(ii) , (iii) and (iv) . 
In yet another broad aspect, the invention is a composition that 
includes a set of 150 molecules for use as tags or tag complements 
wherein each molecule has an oligonucleotide having a sequence of at 
least sixteen nucleotide bases wherein for any pair of sequences of the 
5 set: 

Ml * 18, M2 s 16, M3 z 20, M4 £ 17, and M5 <; 19, where: 
Ml is the maximum number of matches for any alignment of the pair of 
sequences in which there are no internal indels ; 

M2 is the. maximum length of a block of matches for any alignment of the 
pair of sequences; 

M3 is the maximum number of matches for any alignment of the pair of 
sequences having a maximum score; 

M4 is the maximum sum of the lengths of the longest two blocks of 
matches for any alignment of the pair of sequences of maximum score; 
and 

M5 is the maximum sum of the lengths of all the blocks of matches 
having a length of at least 3, for any alignment of the pair of 
sequences of maximum score, wherein: 

the score of a said alignment is determined according to the equation 
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3A - B - 3C - D, wherein: 

A is the total number of matched pairs of bases in the alignment; 
B is the total number of internal mismatched pairs in the 
alignment; 

C is the total number of internal gaps in the alignment; and 
D is the total number of internal indels in the alignment minus 
the total number of internal gaps in the alignment. 
In certain embodiments of the invention, each sequence of a 
composition has up to fifty bases. More -preferably, however, each 
sequence is between sixteen and forty bases in length, or between 
sixteen and thirty-five bases in length, or between eighteen and thirty 
bases in length, or between twenty and twenty-eight bases in length, or 
between twenty-one and twenty- seven bases in length, or between twenty- 
two and twenty- six bases in length. 

* Often, each sequence is of the same length as every other said 
sequence. In particular embodiments disclosed herein, each sequence is 
twenty- four bases in length. 

Again, it can be preferred that no sequence contains more than 
four contiguous bases that are identical to each other, etc., as 
described above. 

In certain preferred embodiments, the composition is such that, 
under a defined set of conditions, the maximum degree of hybridization 
between an oligonucleotide and any complement of a different 
oligonucleotide of the composition does not exceed about 30% of the 
degree of hybridization between said oligonucleotide and its complement, 
more preferably 20%, more preferably 15%, more preferably 10%, more 
preferably 6%. 

Preferably, the set of conditions results in a level of 
hybridization that is the same as the level of hybridization obtained 
when hybridization conditions include 0.2 M NaCl, 0 . 1 M Tris, 0.08% 
Triton X-100, pH 8.0 at 37 °C, and the oligonucleotides are covalently 
linked to mi cropar tides . Of course it is possible that these specific 
conditions be used for determining the level of hybridization. 

It is also preferred that under such a defined set of conditions, 
the degree of hybridization between each oligonucleotide and its 
complement varies by a factor of between 1 and up to 8, more preferably 
up to 7, more preferably up to 6, more preferably up to 5 . In a 
particular disclosed embodiment, the observed variance in the degree of 
hybridization was a factor of only 5.3, i.e., the degree of 
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hybridization between each oligonucleotide and its complement varied by 
a factor of between 1 and 5.6. 

In certain preferred embodiments, under the defined set of 
conditions, the maximum degree of hybridization between a said 
oligonucleotide and any complement of a different oligonucleotide of the 
composition does not exceed about 15%, more preferably 10%, more 
preferably 6%. 

In one preferred embodiment, the set of conditions results in a 
level of hybridization that is the same as the level of hybridization 
obtained when hybridization conditions include 0.2 M NaCl, 0.1 M Tris, 
0.08% Triton X-100, pH 8.0 at 37 °C, and the oligonucleotides are 
covalently linked to microparticles . 

Also, under the defined set of conditions, it is preferred that 
the degree of hybridization between each oligonucleotide and its 
complement varies by a factor of between 1 and up to 8, more preferably 
up to 7, more preferably up to 6, more preferably up to 5. 

Any composition of the invention can include one hundred and sixty 
of the oligonucleotide molecules, or one hundred and seventy of the 
oligonucleotide molecules, or one hundred and eighty of the 
oligonucleotide molecules, or one hundred and ninety of the 
oligonucleotide molecules, or two hundred of the oligonucleotide 
molecules, or two hundred and twenty of the oligonucleotide molecules, 
or two hundred and forty of the oligonucleotide molecules, or two 
hundred and sixty of the oligonucleotide molecules, or two hundred and 
eighty of the oligonucleotide molecules, or three hundred of the 
oligonucleotide molecules, or four hundred of the oligonucleotide 
molecules, or five hundred of the oligonucleotide molecules, or six 
hundred of the oligonucleotide molecules, or seven hundred of the 
oligonucleotide molecules, or eight hundred of the oligonucleotide 
molecules, or nine hundred of the oligonucleotide molecules, or one 
thousand or more of the oligonucleotide molecules. 

A composition of the invention can be a family of tags, or it can 
be a family of tag complements. 

An oligonucleotide molecule belonging to a family of molecules of 
the invention can have incorporated thereinto one more analogues of 
nucleotide bases, preference being given those that undergo normal 
Watson-Crick base pairing. 

The invention includes kits for sorting and identifying 
polynucleotides. Such a kit can include one or more solid phase 
supports each having one or more spatially discrete regions, each such 
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' region having a uniform population of substantially identical tag 

complements covalently attached. The tag complements are made up of a 
set of oligonucleotides of the invention. 

The one or more solid phase supports can be a planar substrate in 
5 which the one or more spatially discrete regions is a plurality of 
spatially addressable regions. 

The tag complements can also be coupled to microparticles . 
Microparticles preferably each have a diameter in the range of from 5 to 
40 pm. 

10 Such a kit preferably includes microparticles that are 

spectrophotometrically unique, and therefore distingui sable from each 
other according to conventional laboratory techniques . Of course for 
such kits to work, each type of microparticle would generally have only 
one tag complement associated with it, and usually there would be a 

15 different oligonucleotide tag complement associated with (attached to) 
each type of microparticle. 

The invention includes methods of using families of 
oligonucleotides of the invention. 

One such method is of analyzing a biological sample containing a 

20 biological sequence for the presence of a mutation or polymorphism at a 
locus of the nucleic acid. The method includes: 

(A) amplifying the nucleic acid molecule in the presence of a first primer 
having a 5' -sequence having the sequence of a tag complementary to the 
sequence of a tag complement belonging to a family of tag complements 
of the invention to form an amplified molecule with a 5' -end with a 
sequence complementary to the sequence of the tag; 

(B) extending the amplified molecule in the presence of a polymerase and a 
second primer having 5' -end complementary the 3' -end of the amplified 
sequence, with the 3' -end of the second primer extending to immediately 
adjacent said locus, in the presence of a plurality of nucleoside 
triphosphate derivatives each of which is: .(i) capable of 
incorporation during transciption by the polymerase onto the 3' -end of 
a growing nucleotide strand; (ii) causes termination of polymerization; 
and (iii) capable of differential detection, one from the other, 
wherein there is a said derivative complementary to each possible 
nucleotide present at said locus of the amplified sequence; 

(C) specifically hybridizing the second primer to a tag complement having 
the tag complement sequence of (A) ; and 

(D) detecting the nucleotide derivative incorporated into the second primer 
in (B) so as to identify the base located at the locus of the nucleic 
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acid. 

In another method of the invention, a biological sample containing 
a plurality of nucleic acid molecules is analyzed for the presence of a 
mutation or polymorphism at a locus of each nucleic acid molecule, for 
each nucleic acid molecule. This method includes steps of: 

(A) amplifying the nucleic acid molecule in the presence of a first primer 
having a 5' -sequence having the sequence of a tag complementary to the 
sequence of a tag complement belonging to a family of tag complements 
of the invention to form an amplified molecule with a 5 '-end with a 
sequence complementary to the sequence of the tag; 

(B) extending the amplified molecule in the presence of a polymerase and a 
second primer having 5' -end complementary the 3' -end of the amplified 
sequence,, the 3' -end of the second primer extending to immediately 
adjacent said locus, in the presence of a plurality of nucleoside 
triphosphate derivatives each of which is: (i) capable of 
incorporation during transciption by the polymerase onto the 3' -end of 
a growing nucleotide strand; (ii) causes termination of polymerization; 
and (iii) capable of differential detection, one from the other, 
wherein there is a said derivative complementary to each possible 
nucleotide present at said locus of the amplified molecule; 

(C) specifically hybridizing the second primer to a tag complement having 
the tag complement sequence of (A) ; and 

(D) detecting the nucleotide derivative incorporated into the second primer 
in (B) so as to identify the base located at the locus of the nucleic 
acid; 

wherein each tag of (A) is unique for each nucleic acid molecule and steps 
(A) and (B) are carried out with said nucleic molecules in the presence of 
each other. 

Another method includes analyzing a biological sample that 
contains a plurality of double stranded complementary nucleic acid 
molecules for the presence of a mutation or polymorphism at a locus of 
each nucleic acid molecule, for each nucleic acid molecule. The method 
includes steps of: 

(A) amplifying the double stranded molecule in the presence of a pair of 
first primers, each primer having an identical 5' -sequence having the 
sequence of a tag complementary to the sequence of a tag complement 
belonging to a family of tag complements of the invention to form 
amplified molecules with 5' -ends with a sequence complementary to the 
sequence of the tag; 

(B) extending the amplified molecules in the presence of a polymerase and a 
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pair of second primers each second primer having a 5' -end' complementary 
a 3' -end of the amplified sequence, the 3' -end of each said second 
primer extending to immediately adjacent said locus, in the presence of 
a plurality of nucleoside triphosphate derivatives each of which is: 
(i) capable of incorporation during trans cipt ion by the polymerase onto 
the 3 '-end of a growing nucleotide strand; (ii) causes termination of 
polymerization; and (iii) capable of differential detection, one from 
the other; 

(C) specifically hybridizing each of the second primers to a tag complement 
having the tag complement sequence of (A) ; and 

(D) detecting the nucleotide derivative incorporated into the second 
primers in (B) so as to identify the base located at said locus; 

wherein the sequence of each tag of (A) is unique for. each nucleic acid 
molecule and steps (A) and (B) are carried out with said nucleic molecules in 
the presence of each other. 

In yet another aspect, the invention is a method of analyzing a 
biological sample containing a plurality of nucleic acid molecules for 
the presence of a mutation or polymorphism at a locus of each nucleic 
acid molecule, for each nucleic acid molecule, the method including 
5 steps of : 

(a) hybridizing the molecule and a primer, the primer having a 5 ' -sequence 
having the sequence of a tag complementary to the sequence of a tag 
complement belonging to a family of tag complements of the invention 
and a 3' -end extending to immediately adjacent the locus; 

(b) enzymatically extending the 3 f -end of the primer in the presence of a 
plurality of nucleoside triphosphate derivatives each of which is: (i) 
capable of enzymatic incorporation onto the 3' -end of a growing 
nucleotide strand; (ii) causes termination of said extension; and (iii) 
capable of differential detection, one from the other, wherein there is 
a said derivative complementary to each possible nucleotide present at 
said locus ; 

(c) specifically hybridizing the extended primer formed in step (b) to a 
tag complement having the tag complement sequence of (a) ; and 

(d) detecting the nucleotide derivative incorporated into the primer in 
step (b) so as to identify the base located at the locus of the nucleic 
acid molecule; 

wherein each tag of (a) is unique for each nucleic acid molecule and steps ' 
(a) and (b) are carried out with said nucleic molecules in the presence of 
each other. 

The derivative can be a dideoxy nucleoside triphosphate. 
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Each respective complement can be attached as a uniform population 
of substantially identical complements in spacially discrete regions on 
one or more solid phase support (s). 

Each tag complement can include a label, each such label being 
different for respective complements, and step (d) can include detecting 
the presence of the different labels for respective hybridization 
complexes of • bound tags and tag complements. 

Another aspect of the invention includes a method of determining 
the presence of a target suspected of being contained in a mixture. The 
method includes the steps of: 

(i) labelling the target with a first label; 

(ii) providing a first detection moiety capable of specific binding to the 
target and including a first tag; 

(iii) exposing a sample of the mixture to the detection moiety under 
conditions suitable to permit (or cause) said specific binding of the 
molecule and target; 

(iv) providing a family of suitable tag complements of the invention wherein 
the family contains a first tag complement having a sequence 
complementary to that of the first tag; 

(v) exposing the sample to the family of tag complements under conditions 
suitable to permit (or cause) specific hybridization of the first tag 
and its tag complement; 

(vi) determining whether a said first detection moiety hybridized to a first 
said tag complement is bound to a said labelled target in order to 
determine the presence or absence of said target in the mixture. 

Preferably , the first tag complement is linked to a solid support 
at a specific location of the support and step (vi) includes detecting 
the presence of the first label at said specified location. 

Also, the first tag complement can include a second label and step 
(vi) includes detecting the presence of the first and second labels in a 
hybridized complex of the moiety and the first tag complement. 

Further, the target can be selected from the group consisting of 
organic molecules, antigens, proteins, polypeptides, antibodies and 
nucleic acids. The target can be an antigen and the first molecule can 
be an antibody specific for that antigen. 

The antigen is usually a polypeptide or protein and the labelling 
step can include conjugation of fluorescent molecules', digoxigenin, 
biotinylation and the like. 
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The target can be a nucleic acid and the labelling step can 
include incorporation of fluorescent molecules, radiolabelled 
nucleotide, digoxigenin, biotinylation and the like. 



5 DETAILED DESCRIPTION OF THE INVENTION 
FIGURES 

Reference is made to the attached figures in which, 

Figure 1 illustrates generally the steps followed to obtain a family of 
sequences of the present invention; 
10 Figure 2 shows the intensity of the signal (MFI) for each perfectly 

matched sequence (probe sequences indicated in Table I) and its complement 
(target, at 50 fmol) obtained as described in Example 1; 

Figure 3 is a three dimensional representation showing cross- 
hybridization observed for the sequences of Figure 2 as described in 
15 Example 1. The results shown in Figure 2 are reproduced along the 
diagonal of the drawing; and 

. Figure 4 is illustrative of results obtained for an individual 
target (SEQ ID NO: 90, target No. 90) when exposed to the 100 probes of 
Example 1. The MFI for each bead is plotted. 

20 

DETAILED EMBODIMENTS 

The invention provides a method for sorting complex mixtures of 
molecules by the use of families of oligonucleotide sequence tags. The 
families of oligonucleotide sequence tags are designed so as to provide 
25 minimal cross hybridization during the sorting process . Thus any sequence 

within a family of sequences will not significantly cross -hybridize with any 
other sequence derived from that family under appropriate hybridization 
conditions known by those skilled in the art. The invention is particularly 
useful in highly parallel processing of analytes. 

30 

Families of Oligonucleotide Sequence Tags 

The present invention includes a family of 24mer polynucleotides that 
have been demonstrated to be minimally cross -hybridizing with each other. 
This family of polynucleotides is thus useful as a family of tags, and their 

35 complements as tag complements. 

In order to be considered for inclusion pinto the family, a sequence had 
to satisfy a certain number of rules regarding its composition. For example, 
repetitive regions that present potential hybridization problems such as four 
or more of a similar base (e.g., AAAA or TTTT) or pairs of Gs were forbidden. 

40 Another rule is that each sequence contains exactly six Gs and no Cs, in order 
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to have sequences that are more or less isothermal. Also required for a 24mer 
to be included is that there must be at most six bases between every 
neighboring pair of Gs. Another way of putting this is that there are at most 
six non-Gs between any two consecutive Gs. Also, each G nearest the 5' -end 
5 (resp. 3'-end) of its oligonucleotide (the left-hand (resp. right-hand) side 
as written in Table I) was required to occupy one of the first to seventh 
positions (counting the S'-terminal (resp. 3 f -terminal ) position as the first 
position. ) 

The process used to design families of sequences that do not exhibit 

10 cross -hybridization behavior is illustrated generally in Figure 1) . Depending 
on the application for which these families of sequences will be used, various 
rules are designed. A certain number of rules can specify constraints for 
sequence composition (such as the ones described in the previous paragraph) . 
The other rules are used to judge whether two sequences are too similar. 

15 Based on these rules, a computer program can derive families of sequences that 
exhibit minimal or no cross-hybridization behavior. The exact method used by 
the computer program is not crucial since various computer programs can derive 
similar families based on these rules. Such a program is for example described 
in international patent application No. PCT/CA 01/00141 published under 

20 WO 01/59151 on August 16, 2001. Other programs can use different methods, 
such as the ones summarized below. 

A first method of generating a maximum number of minimally cross- 
hybridizing polynucleotide sequences starts with any number of non-cross - 
hybridizing sequences, for example just one sequence, and increases the family 

25 as follows. A certain number of sequences is generated and compared to the 

sequences already in the family. The generated sequences that exhibit too much 
similarity with sequences already in the family are dropped. Among the 
" candidate sequences" that remain, one sequence is selected and added to the 
family. The other . candidate sequences are then compared to the selected 

30 sequence, and the ones that show too much similarity are dropped. A new 

sequence is selected from the remaining candidate sequences, if any, and added 
to the family, and so on until there are no candidate sequences left. At this 
stage, the process can be repeated (generating a certain number of sequences 
and comparing them to the sequences in the family, etc.) as often as desired. 

35 The family obtained at the end of this method contains only minimally cross- 
hybridizing sequences . 

A second method of generating a maximum number of minimally cross- 
hybridizing polynucleotide sequences starts with a fixed- size family of 
polynucleotide sequences. The sequences of this family can be generated 

40 randomly or designed by some other method. Many sequences in this family may 



WO 02/059355 



PCT/CA02/00089 



- 41 - 

not be compatible with each other, because they show too much similarity and 
are not minimally cross -hybridizing. Therefore, some sequences need to be 
replaced by new ones, with less similarity'. One way to achieve this consists 
of repeatedly replacing a sequence of the family by the best (that is, lowest 
similarity) sequence among a certain number of (for example, randomly 
generated) sequences that are not part of the family. This process can be 
repeated until the family of sequences shows minimal similarity, hence minimal 
cross -hybridizing, or until a set number of replacements has occurred. If, at 
the end of the process, some sequences do not obey the similarity rules that 
have been set, they can be taken out of the family, thus providing a somewhat 
smaller family that only contains minimally cross -hybridizing sequences. Some 
additional rules can be added to this method in order to make it more 
efficient, such as rules to determine which sequence will be replaced. 

Such methods have been used to obtain the 1168 non- cross -hybridizing 
tags of Table I that are the subject of this patent application. 

One embodiment of the invention is a composition comprising 
molecules for use as tags or tag complements wherein each molecule 
comprises an oligonucleotide selected from a set of oligonucleotides 
based on the group of sequences set out in Table IA, wherein each of the 
numeric identifiers 1 to 3 (see the Table) is a nucleotide base selected 
to be different from the others of 1 to 3 . According to this 
embodiment, several different families of specific sets of 
oligonucleotide sequences are described, depending upon the assignment 
of bases made to the numeric identifiers .1 to 3. 

The sequences contained in Table I have a mathematical 
relationship to each other, described as follows. 

Let S and T be two DNA sequences of. lengths s and t respectively. 
While the term "alignment" of nucleotide sequences is widely used in the 
field of biotechnology, in the context of this invention the term has a 
specific meaning illustrated here. An alignment of S and T is a 2xp 
matrix A (with p > s and p > t) such that the first (or second) row of A 
contains the characters of S (or T respectively) in order, interspersed 
with p-s (or p-t respectively) spaces. It assumed that no column of the 
alignment matrix contains two spaces, i.e., that any alignment in which 
a column contains two spaces is ignored and not considered here. The 
columns containing the same base in both rows are called matches, while 
the columns containing different bases are called mismatches. Each 
column of an alignment containing a space in its first row is called an 
insertion and each colmun containing a space in its second row is called 
a deletion while a column of the alignment containing a space in either 
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row is called an indel. Insertions and deletions within a sequence are 
represented by the character . A gap is a continuous sequence of 
spaces in one of the rows (that is neither immediately preceded nor 
immediately followed by another space in the same row) , and the length 
of a gap is the number of spaces in that gap. An internal gap is one in 
which its first space is preceded by a base and its last space is 
followed by a base aiid an internal indel is an indel belonging to an 
internal gap. Finally, a block is a continuous sequence of matches 
(that is neither immediately preceded nor immediately followed by 
another match) , and the length of a block is the number of matches in 
that block. In order to illustrate these definitions, two sequences S = 
TGATCGTAGCTACGCCGCG (of length s = 19; SEQ ID NO: 1169) and T = 
CGTACGATTGCAACGT (of length t « 16; SEQ ID NO: 1170) are considered. 
Exemplary alignment R x of S and T (with p = 23) is: 



Alignment R x : 











T 


G 


A 


T 


C 


G 


T 


A 


G 


C 


T 


A 


C 


G 


C 


C 


G 


C 


G 


c 


G 


T 


A 


C 


G 


A 


T 






T 




G 


C 


A 


A 


C 


G 


T 










1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


11 


12 


13 


14 


15 


16 


17 


18 


19 


20 


21 


22 


23 



Columns 1 to 4, 9, 10, 12 and 20 to 23 are indels, columns 6, 7, 
8, 11, 13, 14, 16, 17 and 18 are matches, and columns 5, 15 and 19 are 
mismatches. Columns 9 and 10 form a gap of length 2, while columns 16 
to 18 form a block of length 3. Columns 9, 10 and 12 are internal 
indels . 

A score is assigned to the alignment A of two sequences by 
assigning weights to each of matches, mismatches and gaps as follows: 

• the reward for a match m, 

• the penalty for a mismatch mm, 

• the penalty for opening a gap og, 

• the penalty for extending a gap eg. 

Once these values are set, a score to each column of the alignment is 
assigned according to the following rules: 

1- assign 0 to each column preceding the first match and to 

each column following the last match. 
2. for each of the remaining columns, assign m if it is a 

match, -mm if it is a mismatch, -og-eg if it is the first indel 



WO 02/059355 



PCT/CA02/00089 



- 43 - 

of a gap, -eg if it is an indel but not the first indel of a 
gap. 

The score of the alignment A is the sum of the scores of its columns. An 
alignment is said to be of maximum score if no other alignment of the 
same two sequences has a higher score (with the same values of m, mm, og 
and eg) . A person knowledgeable in the field will recognize this method 
of scoring an alignment as scoring a local (as opposed to global) 
alignment with affine gap penalties (that is, gap penalties that can 
distinguish between the first indel of a gap and the other indels) . It 
will be appreciated that the total number of indels that open a gap is 
the same as the total number of gaps and that an internal indel is not 
one of those assigned a 0 in rule (1) above. It will also be noted that 
foregoing rule (1) assigns a 0 for non-internal mismatches. An internal 
mismatch is a mismatch that is preceded and followed (not necessarily 
immediately) by a match. 

As an illustration, if the values of in, mm, og and eg are set to 
3, 1, 2 and 1 respectively, alignment R x has a score of 19, determined 
as shown below: 



Scoring of Alignment R x 
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Note that for two given sequences S and T, there are numerous 
alignments. There are often several alignments of maximum score. 

Based on these alignments, five sequence similarity measures are 
defined as follows. For two sequences S and T, and weights {m, mm, og, 
eg }: 

• Ml is the maximum number of matches over all alignments free of 
internal indels; 

• M2 is the maximum length of a block over all alignments; 

• M3 is the maximum number of matches over all alignments of maximum 
score; 

• M4 is the maximum sum of the lengths of the longest two blocks 
over all alignments of maximum score; 
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• MS is the maximum sum of the lengths of all the blocks of length 
at least 3, over all alignments of maximum score. 

Notice that, by definition, the following inequalities between these 
similarity measures are obtained: M4 < M3 and MS £ M3. Also, in order 
to determine M2 it is sufficient to determine the maximum length of a 
block over all alignments free of internal indels. For two given 
sequences, the values of M3 to M5 can vary depending on the values of 
the weights {in, mm, og, eg }, but not Ml and M2. 

For weights {3, 1, 2, l}, the illustrated alignment is not a 
maximum score alignment of the two example sequences. But for weights 
{6, 6, 0, 6} it is; hence this alignment shows that for these two . 
example sequences, and weights {6, 6, 0, 6}, M2 > 3, M3 > 9, M4 > 6 and 
MS > 6. In order to determine the exact values of Ml to M5 t all the 
necessary alignments need to be considered. Ml and M2 can be found by 
looking at the s+t-1 alignments free of internal indels, where s and t 
are the lengths of the two sequences considered. Mathematical tools 
known as dynamic programming can be implemented on a computer and used 
to determine M3 to MS in a very quick way. Using a computer program to 
do these calculations, it was determined that: 

• with the weights {6, 6, 0, 6}, Ml = 8, M2 = 4, M3> = 10, M4 o 6 and 
M5 = 6} 

• with the weights {3, 1, 2, 1} , Ml = 8, M2 m 4, M3 = 10, M4 « 6 and 
M5 o 4. 

According to the preferred embodiment of this invention, two sequences S 
and T each of length 24 are too similar if at least one of the following 
happens : 

• Ml > 16 or 

• M2 > 13 or 

• M3 > 20 or 

• M4 > 16 or 

• M5 > 19 

when using either weights {6, 6, 0, 6}, or {6, 6, 5, l}, or {6, 2, 5, 
l}, or {6, 6, 6, 0}. In other words, the five similarity measures 
between S and T are determined for each of the above four sets of 
weights, and checked against these thresholds (for a total of 20 tests) . 

■ The above thresholds of 16, 13, 20, 16 and 19, and the above sets 
of weights, were used to obtain the sequences listed in Table I. 
Additional sequences can thus be added to those of Table I as long as 
the above alignment rules are obeyed for all sequences. 
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It is also possible to alter thresholds Ml, M2, etc., while 
remaining within the scope of this invention. It is thus possible to 
substitute or add sequences to those of Table I, or more generally to 
those of Table IA to obtain other sets of sequences that would also 
5 exhibit reasonably low cross -hybridization. More specifically, a set of 
24mer sequences in which there are no two sequences that are too 
similar, where too similar is defined as: 

• Ml > 19 or 

• M2 > 17 or 
10 • M3 > 21 or 

• M4 > 18 or 

• M5 > 20 

when using either weights {6, 6, 0, 6}, or {6, 6, 5, l}, or {6, 2, 5, 
l}, or {6, 6, 6, 0}, would also exhibit low cross-hybridization. 
15 Reducing any of the threshold values provides sets of sequences with 

even lower cross-hybridization. Alternatively, *too similar' can also be 
defined as: 

• Ml > 19 or 

• M2 > 17 or 
20 • M3 > 21 or 

• M4 > 28 or 

• MS > 20 

when using either weights {3, 1, 2, 1} . Alternatively, other 
combinations of weights will lead to sets of sequences with low cross- 

25 hybridization. 

Notice that using weights {6, 6, 0, 6} is equivalent to using 
weights {1, 1, 0, 1}, or weights {2, 2, 0, 2}, ... (that is, for any two 
sequences, the values of Ml to M5 are exactly the same whether weights 
{6, 6, 0, 6} or {l, 1, 0, 1} or {2, 2,0, 2} or any other multiple of . 

30 {1, 1, 0, 1} is used) . 

When dealing with sequences of length other than 24, or sequences 
of various lengths, the definition of similarity can be adjusted. Such 
adjustments are obvious to the persons skilled in the art. For example, 
when comparing a sequence of length LI with a sequence of length L2 

35 (with L1<L2) , they can be considered as. too similar when 
Ml > 19/24 x LI 
M2 > 17/24 x LI 
M3 > 21/24 x LI 
M4 > 18/24 x LI 
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M5 > 20/24 x LI 

when using either weights {6, 6, 0, 6}, or {6, 6, 5, l}, or {6, 2, 5, l} 
or {6, 6, 6, 0}. 

Polynucleotide sequences can be composed of a subset of natural bases 
most preferably A, T and G. Sequences that are deficient in one base possess 
useful characteristics, for example, in reducing potential secondary structure 
formation or reduced potential for cross hybridization with nucleic acids in 
nature. Also, it is preferable to have tag sequences that behave 
isothermally . This can be achieved for example by maintaining a constant- base 
composition for all sequences such as six Gs and eighteen As or Ts for each 
sequence. Additional sets of sequences can be designed by extrapolating on 
the original family of non- cross -hybridizing sequences by simple methods known 
to those skilled in the art. 

In order to validate the sequence set, a subset of sequences from 
the family of 1168 sequence tags was selected and characterized, in 
terms of the ability of these sequences to form specific duplex 
structures with their complementary sequences, and the potential' for 
cross -hybridization within the sequence set. See Example 1, below. The 
subset of 100 sequences was randomly selected, and analyzed using the 
Luminex 100 LabMAP™ platform. The 100 sequences were chemically 
immobilized onto the set of 100 different Luminex microsphere 
populations, such that each specific sequence was coupled to one 
spectrally distinct microsphere population. The pool of 100 
microsphere-immobilized probes was then hybridized with each of the 100 
corresponding complementary sequences. Each sequence was examined 
individually for its specific hybridization with its complementary 
sequence, as well as for its non-specific hybridization with the other 
99 sequences present in the reaction. This analysis demonstrated the 
propensity of each sequence to hybridize only to its complement (perfect 
match) , and not to cross -hybridize appreciably with any of the other 
oligonucleotides present in the hybridization reaction. 

It is within the capability of a person skilled in the art, given the 
family of sequences of Table I, to modify the sequences, or add other 
sequences while largely retaining the property of minimal cross -hybridization 
which the polynucleotides of Table I have been demonstrated to have. 

There are 1168 polynucleotide sequences given in Table I. Since all 
1168 of this family of polynucleotides can work with each other as a minimally 
cross-hybridizing set, then any plurality of polynucleotides that is a subset 
of the 1168 can also act as a minimally cross -hybridizing set of 
polynucleotides. An application in which, for example, 3 0 molecules are to be 
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sorted using a family of polynucleotide tags and tag complements could thus 
use any group of 30 sequences shown in Table I. This is not to say that some 
subsets may be found in a practical sense to be more preferred than others. 
For example, it may be found that a particular subset is more tolerant of a 
wider variety of conditions under which hybridization is conducted before the 
degree of cross -hybridization becomes unacceptable. 

It may be desirable to use polynucleotides that are shorter in length 
than the 24 bases of those in Table I. A family of subsequences (i.e., 
subframes of the sequences illustrated) based on those contained in Table I 
having as few as 10 bases per sequence could be chosen, so long as the 
subsequences are chosen to retain homological properties between any two of 
the sequences of the family important to their non cross -hybridization. 

The selection of sequences using this approach would be amenable to a 
computerized process. Thus for example, a string of 10 contiguous bases of 
the first 24mer of Table I could be selected: AAATTGTGAA AGATTGTTTGTGTA (SEQ ID 
N0:1) . 

The same string of contiguous bases from the second 24mer could then be 
selected and compared for similarity against the first chosen sequence: 
GTTAGAGTTAA TTGTATTTGATGA (SEQ ID NO:2). A systematic pairwise comparison 
could then be carried out to determine if the similarity requirements are 
violated. If the pair of sequences does not violate any set property, a lOmer 
subsequence can be selected from the third 24mer sequence of Table I, and 
compared to each of the first two lOmer sequences (in a pairwise fashion to 
determine its compatibility therewith, etc. In this way a family of lOmer 
sequences may be developed. 

It is within the scope of this invention, to obtain families of 
sequences containing llmer, 12mer, 13mer, 14mer, 15mer, 16mer, 17mer, 18mer, 
19mer, 20mer, 21mer, 22mer and 23mer sequences by analogy to that shown for 
lOmer sequences. 

It may be desirable to have a family of sequences in which there are 
sequences greater in length than the 24mer sequences shown in Table I. It is 
within the capability of a person skilled in the art, given the family of 
sequences shown in Table I, to obtain such a family of sequences. One 
possible approach would be to insert into each sequence at one or more 
locations a nucleotide, non-natural base or analogue such that the longer 
sequence should not have greater similarity than any two of the original non- 
cross -hybridizing sequences of Table I and the addition of extra bases to the 
tag sequences should not result in a major change in the thermodynamic 
properties of the tag sequences of that set for example the GC content must be 
maintained between 10%-40% with a variance from the average of 20%. This 
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method of inserting bases could be used to obtain, for example, a family of 
sequences up to 40 bases long. 

Given a particular family of sequences that can be used as a family of 
tags (or tag complements), e.g., those of Table I, a skilled person will 
readily recognize variant families that work equally as well. 

Again taking the sequences of Table I for example, every T could be 
converted to an A and vice versa and no significant change in the cross - 
hybridization properties would be expected to be observed. This would also be 
true if every G were converted to a C. 

Also, all of the sequences of a family could be taken to be constructed 
in the 5 ! -3' direction, as. is the convention, or all of the constructions of 
sequences could be in the opposition direction (3' -5'). 

There are additional modifications that can be carried out . For 
example, C has not been used in the family of sequences. Substitution of C in 
place of one or more G' s of a particular sequence would yield a sequence that 
is at least as low in homology with every other sequence of the family as was 
the particular sequence chosen for modification. It is thus possible to 
substitute C in place of one or more G's in any of the sequences shown in 
Table I. Analogously, substituting of C in place of one or more A' s is 
possible, or substituting C in place of one or T's is possible. 

It is preferred that the sequences of a given family are of the same, or 
roughly the same length. Preferably, all the sequences of a family of 
sequences of this invention have a length that is within five bases of the 
base-length of the average of the family. More preferably, all sequences are 
within four bases of the average base-length. Even more preferably, all or 
almost all sequences are within- three bases of the average base-length of the 
family. Better still, all or almost all sequences have a length that is 
within two of the base-length of the average of the family, and even better 
still, within one of the base-length of the average of the family. 

It is also possible for a person skilled in the art to derive sets of 
sequences from the family of sequences described in this specification and 
remove sequences that would be expected to have undesirable hybridization 
properties . 

Methods For Synthesis Of Oligonucleotide Families 

Preferably oligonucleotide sequences of the invention are 
synthesized directly by standard phosphoramidite synthesis approaches 
and the like (Caruthers et al, Methods in Enzymology; 154, 287-313: 
1987/ Lipshutz et al, Nature Genet.; 21, 20-24: 1999/ Fodor et al, 
Science; 251, 763-773: 1991). Alternative chemistries- involving non 
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natural bases such as peptide nucleic acids or modified nucleosides that 
offer advantages in duplex stability may also be used (Hacia et al; 
Nucleic Acids Res ;27: 4034-4039, 1999; Nguyen et al, Nucleic Acids 
Res.;27, 1492-1498: 1999; Weiler et al, Nucleic Acids Res:; 25, 2792- 
5 2799:1997). It is also possible to synthesize the oligonucleotide 

sequences of this invention with alternate nucleotide backbones such as 
phosphorothioate or phosphoroamidate nucleotides. Methods involving 
synthesis through the addition of blocks of sequence in a stepwise 
manner may also be employed (Lyttle et al, Biotechniques , 19: 274-280 
10 (1995) . Synthesis may be carried out directly on the substrate to be 

used as a solid phase support for the application or the oligonucleotide 
can be cleaved from the support for use in solution or coupling to a 
second support . 

15 Solid Phase Supports 

There are several different solid phase supports that can be used with 
the invention. They include but are not limited to slides, plates, chips,, 
membranes, beads, microparticles and the like. The solid phase supports can 
also vary in the materials that they are composed of including plastic, glass, 

20 silicon, nylon, polystyrene, silica gel, latex and the like. The surface of 

the support is coated with the complementary tag sequences by any conventional 
means of attachment. 

In preferred embodiments, the family of tag complement sequences is 
derivatized to allow binding to a solid support. Many methods of derivatizing 

25 a nucleic acid for binding to a solid support are known in the art (Hermanson 
G., Bioconjugate Techniques; Acad. Press: 1996). The sequence tag may be 
bound to a solid support through covalent or non-covalent bonds (Iannone et 
al, Cytometry; 39: 131-140, 2000; Matson et al, Anal. Biochem. ; 224: 110-106, 
1995; Proudnikov et al, Anal Biochem; 259: 34-41, 1998; Zammatteo et al, 

30 Analytical Biochemistry; 280:143-150, 2000). The sequence tag can be 

conveniently derivatized for binding to a solid support by incorporating 
modified nucleic acids in the terminal 5' or 3' locations. 

A variety of moieties useful for binding to a solid support (e.g., 
biotin, antibodies, and the like), and methods for attaching them to nucleic 

35 acids, are known in the art. For example, an amine -modified nucleic acid base 
(available from, eg., Glen Research) may be attached to a solid support (for 
example, Covalink-NH, a polystyrene surface grafted with secondary amino 
groups, available from Nunc) through a bifunctiorial crosslinker (e.g., 
bis (sulfosuccinimidyl suberate) , available from Pierce). Additional spacing 
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moieties can be added to reduce steric hindrance between the capture moiety 
and the surface of the solid support. 

Attaching Tags to Analytes for Sorting 

A family of oligonucleotide tag sequences can be conjugated to a 
population of analytes most preferably polynucleotide sequences in several 
different ways including but not limited to direct chemical synthesis, 
chemical coupling, ligation, amplification, and the like. Sequence tags that 
have been synthesized with primer sequences can be used for enzymatic 
extension of the primer on the target for example in PCR amplification. 

Detection of Single Nucleotide Polymorphisms Using Primer Extension 

There are a number of areas of genetic analysis where families of non- 
cross -hybridizing sequences can be applied including disease diagnosis, single 
nucleotide polymorphism analysis, genotyping, expression analysis and the 
like. One such approach for genetic analysis, referred to as the primer 
extension method (also known as Genetic Bit Analysis (Nikiforov et al, Nucleic 
Acids Res.; 22, 4167-4175: 1994; Head et al Nucleic Acids Res.; 25, 5065-5071: 
1997)), is an extremely accurate method for identification of the nucleotide 
located at a specific polymorphic site within genomic DNA. In standard primer 
extension reactions, a portion of genomic DNA containing a defined polymorphic 
site is amplified by PCR using primers that flank the polymorphic site. In 
order to identify which nucleotide is present at the polymorphic site, a third 
primer is synthesized such that the polymorphic position is located 
immediately 3' to the primer. A primer extension reaction is set up 
containing the amplified DNA, the primer for extension, up to 4 
dideoxynucleoside triphosphates (each labeled with a different fluorescent 
dye) and a DNA polymerase such as the Klenow subunit of DNA Polymerase 1. 
The use of dideoxy nucleotides ensures that a single base is added to the 3' 
end of the primer, a site corresponding to the polymorphic site. In this way 
the identity of the nucleotide present at a specific polymorphic site can be 
determined by the identity of the fluorescent dye-labeled nucleotide that is 
incorporated in each reaction. One major drawback to this approach is its low 
throughput. Each primer extension reaction is carried out independently in a 
separate tube. 

Universal sequences can be used to enhance the throughput of primer 
extension assay as follows. A region of genomic DNA containing multiple 
polymorphic sites is amplified by PCR. Alternatively, several genomic 
regions containing one or more polymorphic sites each are amplified 
together in a multiplexed PCR reaction. The primer extension reaction 
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is carried out as described above except that the primers used are 
chimeric, each containing a unique universal tag at the 5' end and the 
sequence for extension at the 3' end. In this way, each gene-specific 
sequence would be associated with a specific universal sequence. The 
5 chimeric primers would be hybridized to the amplified DNA and primer 
extension is carried out as described above. This would result in a 
mixed pool of extended primers, each with a specific fluorescent dye 
characteristic of the incorporated nucleotide. Following the primer 
extension reaction, the mixed extension reactions are hybridized to an 

10 array containing probes that are reverse complements of the universal 

sequences on the primers. This would segregate the products of a , number 
of primer extension reactions into discrete spots. The fluorescent dye 
present at each spot would then identify the nucleotide incorporated at 
each specific location. A number of additional methods for the 

15 detection of single nucleotide polymorphisms, including but not limited 
to, allele specific polymerase chain reaction (ASPCR) , allele specific 
primer extension (ASPE) and oligonucleotide ligation assay (OLA) can be 
performed by someone skilled in the art in combination with the tag 
sequences described herein. 

20 

Kits Using Families Of Tag Sequences 

The families of non cross -hybridizing sequences may be provided in kits 
for use in for example genetic analysis. Such kits include at least one set of 
non-cross -hybridizing sequences in solution or on a solid support. Preferably 
25 the sequences are attached to microparticles and are provided with buffers and 
reagents that are appropriate for the application. Reagents may include 
enzymes, nucleotides, fluorescent labels and the like that would be required 
for specific applications. Instructions for correct use of the kit for a given 
application will be provided. 

30 

EXAMPLES 

EXAMPLE 1 - Cross Talk Behavior of Sequence on Beads 

A group of 100 sequences, randomly selected from Table I, was tested for 
feasibility for use as a family of minimally cross -hybridizing 
35 oligonucleotides. The 100 sequences selected are separately indicated in 
Table I along with the numbers assigned to the sequences in the tests. 

The tests were conducted using the Luminex LabMAP™ platform available 
• from Luminex Corporation, Austin, Texas, U.S.A. The one hundred sequences, 
used as probes, were synthesized as oligonucleotides by Integrated DNA 
40 Technologies (IDT, Coralville, Iowa, U.S.A.). Each probe included a C 6 
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aminolink group coupled to the 5 '-end of the oligonucleotide through a C 12 
ethylene glycol spacer. The C 6 aminolink molecule is a six carbon spacer 
containing an amine group that can be used for attaching the oligonucleotide 
to a solid support. One hundred oligonucleotide targets (probe complements), 
5 the sequence of each being the reverse complement of the 100 probe sequences, 
were also synthesized by IDT. Each target was labelled at its 5' -end with 
biotin. All oligonucleotides were purified using standard desalting 
procedures, and were reconstituted to a concentration of approximately 200 |iM 
in sterile, distilled water for use. Oligonucleotide concentrations were 
10 determined spectrophotometrically using extinction coefficients provided by 
the supplier. 

Each probe was coupled by its amino linking group to a 
carboxylated fluorescent microsphere of the LapMAP system according to 
the Luminex 100 protocol. The microsphere, or bead, for each probe 

15 sequence has unique, or spectrally distinct, light absorption 

characteristics which permits each probe to be distinguished from the 
other probes. Stock bead pellets were dispersed by sonication and then 
vortexing. For each bead population, five million microspheres (4 00 uL) 
were removed from the stock tube using barrier tips and added to a 1.5 

20 mli Eppendorf tube (USA Scientific) . The microspheres were then 

centrifuged, the supernatant was removed, and beads were resuspended in 
25 fiL of 0.2 M MES (2- (N-morpholino) ethane sulfonic acid) (Sigma), pH 
4.5, followed by vortexing and sonication. One nmol of each probe (in a 
25 uL volume) was added to its corresponding bead population. A volume 

25 of. 2. 5 uL of EDC cross-linker ( 1-ethyl -3- (3 -dimethyl aminopropyl) 

carbodiimide hydrochloride (Pierce) , prepared immediately before use by 
adding 1.0 mL of sterile ddH 2 0 to 10 mg of EDC powder, was added to each 
microsphere population. Bead mixes were then incubated for 30 minutes 
at room temperature in the dark with periodic vortexing. A second 2.5 

3 0 \xL aliquot of freshly prepared EDC solution was then added followed by 
an additional 30 minute incubation in the dark. Following the second 
EDC incubation, 1.0 mL of 0.02% Tween-20 (BioShop) was added to each 
bead mix and vortexed. The microspheres were centrifuged, the 
supernatant was removed, and the beads were resuspended in 1.0 mL of 

35 0.1% sodium dodecyl sulfate (Sigma). The beads were centrifuged again 
and the supernatant removed. The coupled beads were resuspended in 100 
uL of 0.1 M MES pH 4.5. Bead concentrations were then determined by 
diluting each preparation 100 -fold in ddH 2 0 and enumerating using a 



WO 02/059355 



PCT/CA02/00089 



- 53 - 

Neubauer BrightLine Hemacytometer. Coupled beads were stored as 
individual populations at 8° 6 C protected from light. 

The relative oligonucleotide probe density on each bead population 
was assessed by Terminal Deoxynucleotidyl Transferase (TdT) end- 
5 labelling with biotin-ddUTPs . TdT was used to label the 3' -ends of 

single -stranded DNA with a labeled ddNTP. Briefly, 180 uli of the pool 
of 100 bead populations (equivalent to about 4000 of each bead' type) to 
be used for hybridizations was pipetted into an Eppendorf tube and 
centrifuged. The supernatant was removed, and the beads were washed in 

10 lx TdT buffer. The beads were then incubated with a labelling reaction 
mixture, which consisted of 5x TdT buffer, 25mM CoCl 2 , and 1000 pmol of 
biotin - 16 -ddUTP (all reagents were purchased from Roche) . The total 
reaction volume was brought up to 85.5 with sterile, distilled H 2 0, 
and the samples were incubated in the dark for 1 hour at 37 °C. A second 

15 aliquot of enzyme was added, followed by a second 1 hour incubation. 
Samples were run in duplicate, as was the negative control, which 
contained all components except the TdT. In order to remove 
unincorporated biotin-ddUTP, the beads were washed 3 times with 200 uL 
of hybridization buffer, and the beads were resuspended in 50 uL of 

20 hybridization buffer following the final wash. The biotin label was 

detected spectrophotometrically using SA-PE (streptavidin-phycoerythrin 
conjugate) . The streptavidin binds to biotin and the phycoerythrin is 
spectrally distinct from the probe beads. The lOmg/mL stock of SA-PE 
was diluted 100-fold in hybridization buffer, and 15 \xh of the diluted 

25 SA-PE was added directly to each reaction and incubated for 15 minutes 
at 37°Celsius. The reactions were analyzed on the Luminex 100 LabMAP . 
Acquisition parameters were set to measure 100 events per bead using a 
sample volume of 50 uL. 

The results obtained are shown in Figure 2 . As can be seen the 

30 Mean Fluorescent Intensity (MFI) of the beads varies from 840.3 to 

3834.9, a range of 4.56-fold. Assuming that the labelling reactions are 
complete for all of the oligonucleotides, this illustrates the signal 
intensity that would be obtained for each type of bead at this 
concentration if the target (i.e., labelled complement) was bound to the 

35 probe sequence to the full extent possible. 

The cross -hybridization of targets to probes was evaluated as 
follows. 100 oligonucleotide probes linked to 100 different bead 
populations, as described above, were combined to generate a master bead 
mix, enabling multiplexed reactions to be carried out. The pool of 
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microsphere -immobilized probes was then hybridized individually with 
each biotinylated target. Thus, each target was examined individually 
for its specific hybridization with its complementary bead- immobilized 
sequence, as well as for its non-specific hybridization with the other 
99 bead- immobilized universal sequences present in the reaction. For 
each hybridization reaction, 25 uL bead mix (containing about 2500 of" 
each bead population in hybridization buffer) was added to each well of 
a 96-well Thermowell PCR plate and equilibrated at 37 °C. Each target 
was diluted to a final concentration of 0.002 fmol/uL in hybridization 
buffer, and 25 uL (50 fmol) was added to each well, giving a final 
reaction volume of 50 uL. Hybridization buffer consisted of 0.2 M NaCl, 
0.1 M Tris, 0.08% Triton X-100, pH 8 . 0 and hybridizations were performed 
at 37°C for 30 minutes. Each target was analyzed in triplicate and six 
background samples (i.e. no target) were included in each plate. A SA- 
PE conjugate was used as a reporter, as described above. The 10 mg/mL 
stock of SA-PE was diluted 100-fold in hybridization buffer, and 15 uL 
of the diluted SA-PE was added directly to each reaction, without 
removal of unbound target, and incubated for 15 minutes at 37°C. 
Finally, an additional 35 uL of hybridization buffer was added to each 
well, resulting in a final volume of 100 uL per well prior to analysis 
on the Ziuminex 100 LabMAP. Acquisition parameters were set to measure 100 
events per bead using a sample volume of 80 uL. 

The percent hybridization was calculated for any event in which 
the NET MFI was at least 3 times the zero target background. In other 
words, a calculation was made for any sample where (MFI aaTnple -MFI zero target 

background) /MFI zero target background > 3. 

The net median fluorescent intensity (MFI Bample -MFI zer o target background) 
generated for all of the 10,000 possible target/probe combinations was 
calculated. As there are 100 probes and 100 targets, there are 100 x 100 
= 10,0000 possible different interactions possible of which 100 are the 
result of perfect hybridizations. The remaining 9900 result from 
hybridization of a target with a mismatched probe. A cross- 
hybridization event is then defined as a non-specific event whose net 
median fluorescent intensity exceeds 3 times the zero target background. 
In other words, a cross -talk calculation is only be made for any sample 

Where (MFI san5 ,i e -MFI zero target background) /MFI 2e ro target background > 3 - CroSS- 

hybridization events were quantified by expressing the value of the 
cross-hybridization signal as a percentage of the perfect match 
hybridization signal with the same -probe. 
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The results obtained are illustrated in Figure 3. The ability of 
each target to be specifically recognized by its matching probe is 
shown. Of the possible 9900 non-specific hybridization events that 
could have occurred when the 100 targets were each exposed to the pool 
of 100 probes, 6 events were observed. Of these 6 events, the highest 
non-specific event generated a signal equivalent to 5.3% of the signal 
observed for the perfectly matched pair (i.e. specific hybridization 
event) . 

Each of the 100 targets was thus examined individually for 
specific hybridization with its complement sequence as incorporated onto 
a microsphere, as well as for non-specific hybridization with the 
complements of the other 99 target sequences. Representative 
hybridization results for target (complement of probe 90, Table I) are 
shown in Figure 4. Probe 90 was found to hybridize only to its 
perfectly-matched target. No cross -hybridization with any of. the other 
99 targets was observed. 

The foregoing results demonstrate the possibility of incorporating 
the 1168 sequences of Table I, or any subset thereof, into a multiplexed 
system with the expectation that most if not all sequences can be 
distinguished from the others by hybridization. That is, it is possible 
to distinguish each target from the other targets by hybridization of 
the target with its precise complement and minimal hybridization with 
complements of the other targets. 

EXAMPLE 2 - Tag sequences used in sorting polynucleotides 

The family of non cross hybridizing sequence tags or a subset thereof 
can be attached to oligonucleotide probe sequences during synthesis and used 
to generate amplified probe sequences. In order to test the feasibility of 
PCR amplification with non cross hybridizing sequence tags and subsequently 
addressing each respective sequence to its appropriate location on two- 
dimensional or bead arrays, the following experiment was devised. A 24mer tag 
sequence can be connected in a 5' -3' specific manner to a p53exon specific 
sequence (20mer reverse primer) . The connecting p53 sequence represents the 
inverse complement of the nucleotide gene sequence. To facilitate the 
subsequent generation of single stranded DNA post-amplification the tag- 
Reverse primer can be synthesized with a phosphate modification (P0 4 ) on the 
5' -end. A second PCR primer can also be generated for each desired exon, 
represented by the Forward (5'~3') amplification primer. In this instance the 
Forward primer can be labeled with a 5'-biotin modification to allow detection 
with Cy3-avidin or equivalent. 



WO 02/059355 



PCT/CA02/00089 



- 56 - 

A practical example of the aforementioned description is as follows : 
For exon 1 of the human p53 tumor suppressor gene sequence the following tag 
Reverse primer (SEQ ID NO: 1171) can be generated: 

222087 222063 
5' - P04 -ATGTTAAAGTAAGTGTTGAAATGT - TCCAGGGAAGCGTGTCACCGTCGT - 3 ' 
Tag Sequence # 3 Exon 1 Reverse 

The numbering above the Exon-1 reverse primer represents the genomic 
nucleotide positions of the indicated bases. 

The corresponding Exon-1 Forward primer sequence (SEQ ID NO: 1172) is as 
follows : 

221873 221896 
5 ' -Biotin-TCATGGCGACTGTCCAGCTTTGTG-3 ' 

In combination these primers will amplify a product of 214 bp plus a 24 
bp tag extension yielding a total size of 238 bp. 

Once amplified, the PCR product can be purified using a QIAquick PGR 
purification kit and the resulting DNA can be quantified. To generate 
single stranded DNA, the DNA is subjected to A.-exonuclease digestion 
thereby resulting in the exposure of a single stranded sequence (anti- 
tag) complementary to the tag- sequence covalently attached to the solid 
phase array. The resulting product is heated to 95°C for 5 minutes and 
then directly applied to the array at a concentration of 10-50 nM. 
Following hybridization and concurrent sorting, the tag-Exon 1 sequences 
are visualized using Cy3-streptavidin. In addition to direct 
visualization of the biotinylated product, the product itself can now 
act as a substrate for further analysis of the amplified region, such as 
SNP detection and haplotype determination. 

DEFINITIONS 

Non-cross -hybridization: Describes the absence of hybridization between 
two sequences that are not perfect complements of each other. 

Cross -hybridization: The hydrogen bonding of a single-stranded DNA 
sequence that is partially but not entirely complementary to a single- 
stranded substrate. 



WO 02/059355 



PCT/CA02/00089 



- 57 - 

Homology or Similarity: How closely related two or more separate strands 
of DNA are to each other, based on their base sequences . 

Analogue: The symbols A, G, T/U, C take on their usual meaning in the 
• 5 art here. In the case of T and U, a person skilled in the art would 

understand that these are equivalent to each other with respect to the 
inter- strand hydrogen-bond (Watson-Crick) binding properties at work in 
the context of this invention. The two bases are thus interchangeable 
and hence the designation of T/U. A chemical, which resembles a 
10 nucleotide base is an analogue thereof. A base that does not normally 
appear in DNA but can substitute for the ones, which do, despite minor 
differences in structure. Analogues particularly useful in this 
invention are of the naturally occurring bases can be inserted in their 
respective places where desired. Such an analogue is any non-natural 
15 base, such as peptide nucleic acids and the like that undergoes normal 
Watson-Crick pairing in the same way as the naturally occurring 
nucleotide base to which it corresponds . 

Complement: The opposite or "mirror" image of a DNA sequence. A 
20 complementary DNA sequence has an "A" for every "T" and a "C" for every 
"G" . Two complementary strands of single stranded DNA, for example a tag 
sequence and its complement, will join to form a double -stranded 
molecule . 

25 Complementary DNA (cDNA) : DNA that is synthesized from a messenger RNA 

template; the single- stranded form is often used as a probe in physical 



Oligonucleotide: Refers to a short nucleotide polymer whereby the 
30 nucleotides may be natural nucleotide bases or analogues thereof. 

Tag: Refers to an oligonucleotide that can be used for specifically sorting 
analytes with at least one other oligonucleotide that when used together do 
not cross hybridize. 
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Table I 



Sequence 



SSQ ID NOt 



No. in 
Ex 1 



AAATTGTGAAAGATTGTTTGTGTA 
GTTAGAGTTAATTGTATTTGATGA 
ATGTTAAAGTAAGTGTTGAAATGT 
TGATGTTAGAAGTATATTGTGAAT 
TTTGTGTAGAATATGTGTTGTTAA 



1 
2 
3 
4 
5 



1 
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T 


T 


T G 


A 


G 


T 


A 


A 


T 


144 


- 


G 


A T G 


T A 


T 


A 


A 


T 


G 


A T G 


T 


G 


T G 


T 


A 


A 


A 


T 


T 


145 


- 


A 


T G T 


G A 


G 


A 


G 


A 


A 


G A A 


T 


T 


T G 


T 


T 


T 


A 


T 


T 


146 


- 


G 


T G A 


T A 


A 


A 


G 


T 


A 


T T G 


T 


T 


G A 


T 


A 


G 


A 


A 


A 


147 


- 


G 


A A G 


T A 


G 


A 


A 


T 


A 


G A A 


A 


G 


T T 


A. 


A 


T 


A 


G 


A 


148 


- 


T 


T G T 


G.T 


A 


G 


T 


T 


A 


A G A 


G 


T 


T G 


T 


T 


T 


A 


A 


T 


149 


24 


T 


A G T 


A G 


T 


A 


A 


G 


T 


T G T 


T 


A 


G A 


A 


T 


A 


G 


T 


T 


150 


- 


A 


ATT 


T G 


A 


A 


G 


T 


A 


T A A 


T 


G 


A A 


T 


G 


T 


G 


T 


G 


151 


- 


T 


A G A 


A A 


T 


T 


G 


T 


A G T A 


T 


T 


T G 


A 


G 


A 


G 


A 


A 


152 


- 


T 


G T A 


T A 


T 


G 


T 


T 


A A T G 


A 


G 


A T 


G 


T 


T 


G 


T 


A 


153 


25 


T 


ATT 


TGATAAGAGAATGAAGAAG 


T 


154 


26 


T 


T G A 


A T 


A 


G 


T 


G 


T 


A A T 


G 


A 


A T 


A 


T 


G 


A 


T 


G 


155 


- 


G 


TAG 


T T 


T 


G 


T 


G 


A 


ATA 


G 


A 


A T 


T 


A 


G 


T 


T 


T 


156 


- 


A 


A A G 


A T 


G 


A 


T 


T 


G 


T A A 


T 


T 


T G 


T 


G 


T 


G 


A 


A 


157 




G 


A A G 


A T 


T 


G 


T 


T 


G 


A G T 


T 


A A T 


A 


G 


A 


T 


A 


A 


158 


- 


A 


GAT 


T A 


T 


G 


T 


A 


G 


T G A 


T 


G 


T A 


A 


A 


T 


G 


T 


T 


159 


- 


G 


A A T 


T T 


A 


G 


A 


T 


G 


TAG 


A 


T 


A T 


G 


A 


A 


T 


G 


T 


160 


- 


G 


ATA 


G A 


A 


G 


T 


G 


T 


ATT 


A 


A 


G T 


A 


A 


G 


T 


T 


A 


161 


- 


T 


A T G 


A A 


T 


T 


A 


T 


G 


A G A 


A 


G 


A A 


T 


A 


G 


A 


G 


T 


162 


- 


T 


T T G 


T T 


A 


T 


G 


A 


A 


G T G 


A 


T 


T T 


G 


T 


T 


T 


G 


T 


163 


- 


G 


T A A 


A G 


A 


T 


T 


G 


T 


G T T 


A 


T 


A T 


G 


A 


A 


A 


T 


G 


164 


- 


T 


T G T 


G A 


T 


A 


G 


T 


A 


G T T 


A 


G 


A T 


A 


T 


T 


T 


G 


T 


165 


28 


G 


A A T 


T A 


A 


G 


A 


T 


A 


A A G 


A 


A 


G A 


G 


A 


A 


G 


T 


A 


166 


- 


G 


ATT 


G T 


A 


G 


A 


A 


T 


G A A 


T 


T 


T G 


T 


A 


G 


T 


A 


T 


167 




A 


A A T 


A A 


G 


A 


G 


A 


G 


A G A 


A 


T 


G A 


T 


T 


T 


A 


G 


T 


168 




A 


ATT 


A T 


G 


T 


G 


A 


A 


TAG 


A 


T 


T G 


T 


T 


G 


A 


A 


G 


169 




T 


T A A 


G A 


T 


T 


T 


A 


T 


G T{ G 


A 


T 


A G 


T 


A 


G 


A 


G 


T 


170 




T 


T A A 


A G 


A 


T 


A 


G 


T 


G T T 


T 


G 


TT 


G 


T 


G 


T 


T 


A 


171 




T 


ATT 


G A 


T 


T 


T 


A 


T 


G A A 


G A 


G T 


A 


T 


A 


G 


T 


G 


172 




A 


A A T 


T T 


G 


A 


T 


G 


A 


G T A 


G 


T 


T T 


A 


A 


G 


A- 


G 


A 


173 




A 


T A A 


A G 


T 


T 


G 


T 


T 


T G A 


T 


G 


T T 


T 


G 


A 


A 


T 


G 


174 




G 


ATT 


G T 


G 


A 


T 


G 


A 


ATA 


A 


T 


G T 


T 


A 


T 


T 


G 


A 


175 




G 


A T G 


A A G A A A 


T A T G 


A 


T 


A T 


G 


A A 


T 


A 


G 


176 





WO 02/059355 



PCT/CA02/00089 



- 61 - 



Table I 

Sequence SEQ ID NO: No. in 

Ex 1 



rp 
1 


T 


A 


A 


A 


G 


T 


T 


A 


T 


T 


G 


A 


A 


G 


T 


G 


A 


A 


G 


•T 


T 


G 


A 


177 




rp 


rp 
1 




rp 
1 


7V 
A 


TV 

A 




TV 

A 


TV 

A 


TV 
A 


rp 


TV 

A 


G 


TV 

A 




TV 

A 


T 


T 


T 


G 


T 


G 


rr> 
T 


rp 
1 


178 




Cjt 


7V 
A 


G 


A 


T 


T 


G 


TV 

A 


G 


T 


T 


T 


A 


A 


G 


T 


A 


T 


T 


A 


G 


A 


T 


T 


179 




A 

A 


r~i 


rp 


r~i 
<J 


A 


rp 


7V 
A 


TV 

A 


T 


A 


G 


A 


A 


T 


G 


A 


T 


A 


A 


A 


T 


G 


T 


G 


180 




r* 

K3 


TV 

A 


rp 
T 


A 


A 


T 


A 


G 


T 


G 


A 


A 


T 


T 


T 


G 


A 


G 


T 


T 


G 


T 


A 


T 


181 




TV 




A 


T 


A 


T 


T 


T 


G 


T 


A 


G 


T 


A 


G 


A 


A 


A 


G 


T 


A 


T 


G 


T 


182 




/-» 
Vj 


rp 
I 


rp 


A 


T 


G 


A 


TV 
A 


T 


G 


T 


T 


G 


A 


A 


T 


T 


T 


G 


A 


A 


T 


G 


T 


183 




A 


T 


G 


A 


A 


A 


G 


A 


T 


T 


T 


A 


G 


T 


T 


G 


T 


G 


A 


G 


A 


T 


A 


T 


184 


30 


A 


A 


A 


T 


A 


G 


A 


G 


A 


A 


G 


T 


T 


A 


T 


G 


A 


T 


G 


T 


G 


A 


T 


A 


185 




T 


T 


A 


G 


T 


G 


A 


G 


A 


A 


A 


T 


G 


T 


T 


T 


A 


A 


T 


G 


T 


G 


A 


T 


186 




rp 

JL 


Lj 


A 


A 


G 


A 


A 


T 


A 


T 


G 


T 


G 


A 


A 


A 


T 


T 


A 


G 


T 


T 


T 


G 


187 




rt 

Cj 


T 


T 


T 


G 


A 


T 


A 


G 


T 


T 


T 


A 


A 


T 


G 


A 


G 


T 


A 


T 


T 


G 


A 


188 




G 


T 


T 


G 


T 


A 


A 


G 


T 


A 


A 


T 


G 


A 


T 


A 


A 


A 


G 


T 


A 


T 


G 


A 


189 


■* 


T 


A 


A 


G 


A 


G 


T 


A 


G 


T 


A 


A 


T 


T 


G 


T 


T 


G 


T 


T 


T 


A 


G 


A 


190 




T 


T 


T 


G 


A 


G 


A 


G 


A 


G 


T 


A 


T 


G 


T 


A 


T 


G 


A 


T 


T 


A 


T 


T 


191 




TV 

A 


T 


T 


G 


A 


T 


T 


G 


T 


G 


A 


A 


T 


T 


A 


G 


A 


T 


A 


G 


A 


A 


G 


A 


192 




G 


A 


T 


T 


A 


G 


T 


A 


T 


T 


T 


A 


G 


T 


A 


G 


T 


A 


A 


T 


A 


G 


A 


G 


193 


31 


T 


A 


T 


G 


T 


A 


T 


T 


A 


G 


A 


G 


A 


T 


A 


T 


T 


G 


A 


A 


A 


G 


T 


G 


194 


— 


T 


A 


T 


G 


T 


G 


A 


A 


A 


G 


T 


A 


A 


T 


G 


A 


T 


A 


A 


A 


.T 


G 


A 


G 


195 


** 


G 


T 


A 


A 


T 


T 


A 


G 


T 


A 


A 


T 


G 


A 


T 


T 


T 


G 


A 


A 


T 


G 


A 


G 


196 


— 


G 


T 


T 


T 


A 


T 


T 


G 


T 


A 


A 


A 


G 


A 


T 


G 


T 


A 


A 


G 


T 


G 


A 


A 


197 




T 


A 


G 


T 


A 


G 


A 


A 


T 


T 


G 


T 


T 


G 


T 


T 


A 


A 


A 


G 


A 


A 


T 


G 


198 


32 


T 


A 


T 


T 


G 


T 


T 


A 


G 


T 


T 


A 


T 


G 


T 


A 


G 


T 


G 


T 


G 


T 


A 


A 


199 


— 


G 


A 


G 


T 


G 


A 


A 


A 


G 


T 


T 


A 


T 


A 


T 


G 


A 


A 


A 


G 


T 


A 


T 


A 


200 




A 


T 


A 


T 


A 


G 


A 


A 


G 


T 


T 


G 


A 


T 


G 


A 


G 


T 


T 


T 


A 


T 


G 


A 


201 




T 


T 


T 


A 


G 


A 


A 


G 


T 


A 


A 


G 


A 


A 


T 


A 


A 


G 


T 


G 


A 


G 


T 


A 


202 




T 


G 


T 


G 


T 


A 


T 


A 


A 


G 


A 


T 


A 


T 


T 


T 


G 


T 


A 


A 


G 


A 


A 


G 


203 


— 


T 


A 


G 


A 


A 


G 


A 


G 


T 


T 


G 


T 


A 


T 


T 


G 


T 


T 


A 


T 


A 


A 


G 


T 


204 




G 


T 


G 


T 


T 


A 


T 


T 


A 


G 


T 


T 


T 


A 


A 


G 


T 


T 


A 


G 


A 


G 


T 


A 


205 




A 


A 


T 


A 


T 


A 


G 


T 


G 


A 


T 


G 


T 


G 


A 


A 


A 


T 


T 


G 


A 


A 


T 


G 


206 




T 


T 


A 


G 


A 


G 


A 


A 


T 


A 


G 


A 


G 


T 


G 


A 


T 


T 


A 


T 


A 


G 


T 


T 


207 




G 


A 


A 


G 


T 


G 


A 


G 


T 


T 


A 


A 


T 


G 


A 


T 


T 


T 


G 


T 


A 


A 


A 


T 


208 




A 


A 


T 


G 


T 


A 


A 


A 


G 


T 


A 


A 


A 


G 


A 


A 


A 


G 


T 


G 


A 


T 


G 


A 


209 


33 


G 


T 


T 


A 


G 


T 


T 


A 


T 


G 


A 


T 


G 


A 


A 


T 


A 


T 


T 


G 


T 


G 


T 


A 


210 


34 


A 


A 


A 


T 


G 


A 


G 


T 


T. 


A 


G 


A 


G 


T 


A 


G 


A 


A 


T 


T 


A 


T 


G 


T 


211 




G 


A 


T 


A 


T 


A' 


G 


A 


A 


G 


A 


T 


T 


A 


G 


T 


T 


A 


G 


T 


G 


A 


T 


A 


212 




TV 

A 


T 


A 


G 


T 


T 


T 


G 


T 


T 


G 


A 


G 


A 


T 


T 


T 


A 


T 


G 


A 


G 


T 


A 


213 




T 


A 


G 


A 


A 


T 


A 


G 


T 


T 


A 


G 


T 


A 


G 


T 


A 


A 


G 


A 


G 


T 


A 


T 


214 






A 


A 


T 


T 


T 


G 


T 


A 


T 


T 


G 


T 


G 


A 


A 


G 


T 


T 


T 


A 


G 


T 


A 


215 




/-i 
(j 


T 


A 


G 


T 


A 


A 


G 


A 


A 


G 


A 


G 


A 


A 


T 


T 


A 


G 


A 


T 


T 


A 


A 


216 




A 


TV 

A 


T 


tar 


T 


Gr 


T 


T 


A 


T 


G 


T 


A 


T 


G 


T 


A 


A 


A 


T 


A 


G 


T 


G 


217 




rt 


7V 
A 


A 


T 


T 


A 


G 


T 


T 


A 


G 


A 


G 


T 


A 


A 


A 


T 


T 


G 


T 


T 


T 


G 


218 




G 


A 


A 


A 


T 


T 


G 


A 


A 


G 


A 


T 


A 


G 


T 


A 


A 


G 


A 


A 


A 


T 


G 


A 


219 




c* 
\j 


rp 

T 


r* 
(j 


rp 

T 


TV 

A 


rp 

T 


m 

T 


A 


T 


G 


T 


G 


A 


T 


T 


T 


A 


T 


G 


A 


T 


A 


G 


TV 
A 


220 




rp 


TV 

A 


rrf 

T 


T 


A 


T 


G 


A 


G 


A 


A 


A 


G 


T 


T 


G 


A 


A 


T 


A 


G 


T 


A 


G 


221 


35 


T 


A 


T 


G 


T 


A 


T 


T 


G 


T 


A 


T 


T 


G 


A 


G 


T 


A 


G 


A 


T 


G 


A 


A 


222 




G 


T 


G 


A 


T 


T 


G 


A 


A 


T 


A 


G 


T 


A 


G 


A 


T 


T 


G 


T 


T 


T 


A 


A 


223 


36 


A 


G 


T 


A 


A 


G 


T 


T 


G 


T 


T 


T 


G 


A 


T 


T 


G 


A 


A 


A 


T 


T 


T 


G 


224 




G 


A 


A 


G 


T 


T 


T 


G 


A 


T 


T 


T 


A 


A 


G 


T 


T 


T 


A 


A 


G 


A 


A 


G 


225 




G 


A 


G 


A 


A 


G 


A 


T 


A 


A 


A 


T 


G 


A 


T 


A 


T 


T 


G 


T 


T 


A 


T 


G 


226 




A 


T 


G 


A 


T 


G 


A 


G 


T 


T 


G 


T 


T 


A 


A 


T 


A 


G 


T 


T 


A 


G 


T 


T 


227 




T 


A 


T 


G 


A 


T 


A 


T 


T 


T 


G 


A 


A 


G 


A 


G 


T 


G 


T 


T 


A 


A 


G 


A 


228 




G 


A 


G 


A 


T 


G 


A 


T 


T 


A 


A 


A 


G 


T 


G 


A 


T 


T 


T 


A 


T 


G 


A 


A 


229 




A 


T 


A 


G 


T 


T 


A 


A 


G 


A 


G 


T 


G 


A 


T 


G 


A 


G 


A 


A 


T 


A 


A 


A 


230 




T 


T 


T 


A 


T 


T 


G 


T 


T 


A 


G 


A 


T 


A 


A 


A 


G 


A 


G 


T 


T 


G 


A 


G 


231 




A 


G 


A 


A 


T 


A 


T 


T 


G 


A 


T 


A 


G 


T 


T 


G 


A 


A 


G 


T 


T 


G 


A 


A 


232 




T 


A 


G 


T 


G 


T 


A 


A 


A 


G 


T 


G 


T 


A 


G 


A 


T 


T 


G 


T 


A 


A 


A 


T 


233 





WO 02/059355 



PCT/CA02/00089 



- 62 - 
Table I 













Sequence 
























xu WO : 


No • i; 
Ex 1 


A *G T 




Q 


X 


G 


A T 


A T 1 n A 


rp 
1 


rp 

X 


i 


G 


A 


A 


rp 

T 


A 


rp 

T 


T 


G 


234 




T G T 




T 


T 


G 


A A 


t 1 t a. r* 
x l A \j 


A 


A 

A 


rp 
X 


TV 

A 


G 


1 


r~i 

g 


A 


G 


A 


A 


235 




T G A 


T 


a 
*\ 


X 


G 


A G 


A T> A r* 
A X A G 


A 

A 


A 


G 


rp 

T 


rp 

T 


rp 

T 


A 


A 


T 


G 


m 

T 


236 




G A A 






A 


G 


T A 


A n A 
A G 1 A 


m 
1 


A 

A 


A 
A 


A 

A 


G 


rp 
X 


TV 

A 


A 


A 


rp 
T 


ft 

G 


237 




T T T 

XXX 


A 
■t\ 


A 
±\ 


n 


T 


G T 


/"* A f| TV 

g a r A 


A 


G 


A 


A 


A 


G 


A 


T 


A 


G 


A 


238 




T T 

X .f-l x 


T 

X 




X 


T 


G A 


A *T* f~* rn 
A 1 G T 


G 


rn 
1 


T 


T 


A 


A 


A 


G 


A 


G 


A 


239 


38 


G A A 


T 


A 


A 


T 


G A 


T G A G 


A 


T 


G 


A 


T 


T 


A 


T 


T 


G 


A 


240 


~* 


TAG 


A 


G 


A 


A 


A G 


A G A G 


A 


A 


T 


T 


G 


T 


A 


T 


T 


A 


A 


241 


39 


A T G 


T 


A 


T 


A A T 


GAGA 


T 


A 


T 


G 


T 


T 


T 


G 


T 


G 


A 


242 


- 


A A T 


A 


G 


A 


T 


A A 


G A T T 


G 


A 


T 


T 


G 


T 


G 


T 


T 


T 


G 


243 


40 


T T T 


G 


A 


T 


G 


A T 


A A T A 


G 


A 


A 


G 


A 


G 


A 


A 


T 


G 


A 


244 




A G A 


T 


G 


A 


A 


T A 


A G T T 


G 


T 


G 


A 


A 


T 


G 


T 


T 


T 


A 


245 


— 


A G A 


T 


G 


A 


A 


A G 


A A A G 


T 


G 


T 


A 


G 


A 


A 


T 


A 


T 


T 


246 


— 


T G T 


T 


A 


A 


A 


T G 


T A T G 


T 


A 


G 


T 


A 


A 


T 


T 


G 


A 


G 


247 


41 


TAG 


T 


A 


G 


T 


G T 


G A A G 


T 


T 


A 


T 


T 


T 


G 


T 


T 


A 


T 


248 




AGTGAATGT 


T T G T 


A 


A 


A 


G 


A 


G 


T 


T 


T 


A 


A 


249 




GAT 


A 


A 


A 


T 


G A 


G A A T 


T 


G 


A 


G 


T 


A 


A 


T 


T 


G 


T 


250 


— 


T G A 


T 


G 


A 


G 


A A 


A T T G 


T 


T 


T 


A 


A 


G 


T 


G 


T 


T 


T 


251 


— 


AAA 


T 


A 


A 


G 


T A 


G T G T 


G 


A 


G 


T 


A 


A 


T 


A 


G 


T 


A 


252 




TAT 


G 


A 


A 


A 


T A 


T G T G 


A 


T 


A 


G 


T 


A 


A 


G 


A 


G 
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253 


— 


ATT 


G 


T 


A 


A 


G A 


G T G A 


T 


T 


A 


T 


A 


G 


A 


T 


G 


A 


T 


254 


— 


A G A 


G 


T 


A 


A 


G A 


A T G A 


A 


A 


G 


A 


G 


A 


T 


A 


A 


T 


A 


255 


— 


T A A 


G 


T 


A 


A 


G T 


A G A T 


G 


T 


T 


A 


A A 


G 


A 


G 


A 


T 


256 




AAA 


T 


A 


G 


A 


A A 


G A A T 


T 


G 


T 


A 


G 


A 


G 


T 


A 


G 


T 


257 


— 


ATA 


G 


A 


T 


T 


T A 


A G T G 


A 


A 


G 


A 


G 


A 


G 


T 


T 


A 


T 


258 


42 


G A A 


T 


G 


T 


T 


T G 


T A A A 


T 


G 


T 


A 


T 


A 


G 


A 


T 


A 


G 


259 


43 


AAA 


T 


A 


G 


A 


A T 


G A G T 


A 


G 


T 


G 


A A 


A 


T 


A 


T 


G 


260 




T T G 


A A 


T 


T 


A T 


G T A G 


A 


G 


A 


A A 


G 


T 


A 


A 


A 


G 


261 
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T 


A A 


A 


T T 


GAGA 


G 


T 


A 


G 


T 


T 


G 


A 


A 


T 


T 


262 


~ 


m p m 
X G 1 


A 


A A 


G 


T G 


m m m » 

T T T A 


T 


A 


G 


T 


G 


T 


G 


T 


A 


A 


T 


263 
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T 


G 


A 


T 


T T 


GAGA 


T 


G 


A 


G 


A 


A 


T 


G 


T 


A 


A 


264 




A A T 


A 


T 


T 


G 


A T 


7\ m m 

A T G T 


G 


T 


T 


G 


T 


G 


A 


A 


G 


T 


A 


265 


— 


A G T 


G 


A 


G 


A 


T T 


A T G A 


G 


T 


A 


T 


T 


G 


A 


T 


T 


T 


A 


266 


44 


T T G 


T 


A 


T 


T 


T A 


G A T A 


G 


T 


G 


A 


G 


A 


T 


T 


A 


T 


G 


267 


— 


ATA 


G 


AAA 


T G 


A A A G 


A 


T 


A 


G 


A 


T 


A 


G 


A 


A 


G 


268 


- 


GAT 


T 


G 


T 


A 


T A 


T G T A 


A 


A 


G 


T 


A 


G 


T 


T 


T 


A 


G 


269 




TAT 


G 


A 


A 


T 


G T 


T A T T 


G 


T 


G 


T 


G 


T 


T 


G 


A 


T 


T 


270 


45 


GAT 


A 


T 


T 


A 


G T 


A G A G 


T 


A 


A 


G 


T 


A 


T 


A 


T 


T 


G 


271 


- 


T G A 


G 


A 


T 


G 


A A 


T T T G 


T 


G 


T 


T 


A 


T 


G 


A 


T 


A 


T 


272 




TAT 


G 


A 


A 


T 


G A 


A G T A 


A 


A 


G 


A 


G 


A 


T 


G 


T 


A 


A 


273 


— 


GAG 


T 


G 


A 


A 


T T 


T G T T 


G 


T 


A 


A 


T 


T 


T 


G 


T 


T 


T 


274 


— 


A G A 


A 


A 


T 


T 


G T 


A G A G 


T 


T 


A 


A 


T 


T 


G 


T 


G 


T 


A 


275 


_ 


G T G 


T 


T 


A 


A 


T G 


A A A ft 

A A A G 


T 


T 


G 


T 


G 


A 


A 


T 


A 


A 


T 


276 


** 


T G T 


G 


A 


T 


T 


T G 


T T A A 


G 


A 


A 


G 


A 


T 


T 


A 


A 


T 


G 


277 




A G T 


A 


G 


T 


A 


T T 


G T A A 


A 


G 


T 


A 


T 


A 


A 


A 


G 


A 


G 


278 


— 


T G A 


T 


T 


G 


T 


T G 


T 7A T 1 A 
1 A X A 


G 


T 


T 


A 


T 


T 


G 


T 


G 


T 


A 


279 




G' A T 


T 


G 


T 


A 


G T 


T T A A 


rp 

T 


G 


T 


T 


A 


A 


G 


A 


A 


T 


G 


280 


— 


A T G 


A 


A 


A 


T 


A A 


G A A A 


X 


rp 

X 


GAG 


T 


TV 

A 


G 


TV 

A 


G 


TV 

A 


281 




TAT 


G A 


T 


G 


A T 


A T T T 


G 


T 


T 


G- T 


A 


T 


G 


T 


G 


T 


282 




T T T 


A 


G 


A 


G 


T T 


T G A T 


T 


A 


G 


T 


A 


T 


G 


T 


T 


T 


G 


283 




A A T 


A 


A 


G 


A 


G A 


T T G T 


G 


A 


T 


G 


A 


G 


A 


A 


A 


T 


A 


284 




A A T 


G 


A 


A 


T 


A G 


A A T A 


G 


A 


G 


A 


A 


T 


G 


T 


A 


G 


A 


285 




G T A 


G 


T 


A 


G 


T A 


A T T T 


G 


A 


A 


T 


G 


T 


T 


T 


G 


A 


A 


286 


47 


A G T 


G 


A 


G 


T 


A A 


T T G A 


T 


T 


G 


A 


T 


T 


G 


T 


T 


A 


A 


287 




G A A 


T 


A A 


T 


G T 


T T A G 


T 


G 


T 


G 


T 


T 


T 


G 


A 


A 


A 


288 




ATA 


T 


G 


A A 


A G 


T A G A 


G 


A 


A 


A 


G 


T 


G 


T 


T 


A 


T 


289 




T G A 


G 


T 


T 


A 


T T 


G T A T 


T 


T 


A 


G 


T 


T 


T 


G 


A 


A 


G 


290 
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Table I 

Sequence SEQ ID NO: No. in 

Ex 1 



X 


A 
A 


P 


rp rp 
X X 


p 

G 


A 

A 


p 

G 


rp 

X 


rp 
X 


rp 
X 


A 

A 


A 

A 


A 


G 


rp 
X 


rp 
X 


G 


A A 


A 

A 


G 


A 

A 


A 

A 


Z9 1 




rp 
1 


A 

A 


A 

A 


A G 


A 

A 


G 


rp 
X 


p 
G 


A 
A 


rp 
X 


G 


rp 
X 


A A 


A 

A 


rp 


A 

A 


G 


A 


A 

A 


rt 

G 


rp 
X 


rp 
X 


292 




rp 
X 




rp 
X 


A G 


rp 
X 


p 
G 


rp 
X 


rp 
X 


rp 
X 


A 

A 


G 


A 

A 


G 


T 


A 

A 


A 

A 


G 


T 


T 


A 

A 


rp 
X 


rp 

r 


A 

A 


O Q O 

293 




a 
i-i. 


p 


A 
A 


Vj A 


rp 

X 


rp 
X 


A 
A 


A 

A 


rp 
X 


P 

G 


rp 
X 


G 


T 


T 


p 
G 


A 

A 


A 

A 


A 


G 


A 

A 


rp 
X 


rp 
X 


A 

A 


294 




r* 
va 


rp 


A 
A 


A T 1 

A 1 


A 


TV 

A 


G 


rp 
X 


rp 
X 


G 


rp 
X 


G 


A 


A 


A 


rt 

G 


A 


A 


G 


A 


T 


T 


A 


295 






A 
A 


p 


A rp 
A X 


G 


rp 

X 


rp 
X 


A 


rp 
X 


A 


G 


A 


T 


A 


A 


T 


G 


A 


A 


A 


G 


A 


A 


296 




X 


rp 


rp 
1 


A r* 

A Vj 


rp 
X 


rp 
X 


G 


A 

A 


rp 

X 


rp 
X 


G 


rp 


T 


G 


A 

A 


A 

A 


T 


A 


G 


A 


G 


T 


A 


297 




A 

A 


rp 
1 


rp 


A rp 
A X 


rp 
X 


G 


A 

A 


A 

A 


A 

A 


G 


rp 
X 


A 
A 


G 


A 


rp 


G 


T 


T 


A 


G 


A 


T 


G 


298 




1 


rp 


X 


A T 1 
A X 


G 


rp 
X 


G 


rp 
X 


G 


A 


rp 
X 


rp 
X 


G 


A 


G 


rp 

T 


G 


T 


T 


T 


A 


A 


T 


299 




1 


A 
A 


rp 
X 


rp rp 

1 T 


A 


G 


rp 
1 


rp 

X 


A 


G 


A 


T 


A 


G 


A 


T 


A 


G 


A 


G 


A 


G 


T 


300 




A 

A 


rp 
1 


G 


T G 


rp 

X 


rp 
X 


rp 

X 


A 

A 


rp 
X 


G 


rp 

T 


G 


A 


A 


A 

A 


rt 

G 


A 


T 


T 


T 


G 


T 


A 


301 




A 


rp 


A 


G T 


A 


A 


T 


T 


A 

A 


G 


A 


A 


G 


A 


G 


A 


A 


G 


A 


A 


T 


G 


T 


302 




rp 
X 


A 

A 


T 


G A 


G 


T 


G 


A 


T 


T 


A 


G 


A 


A 


T 


T 


G 


T 


A 


T 


T 


T 


G 


303 




rp 
1 


rp 
1 


A 


A T 


G 


T 


A 


T 


T 


G 


T 


T 


T 


A 


A 


A 


G 


A 


G 


T 


G 


T 


G 


304 






rp 
T 


A 


G A 


G 


A 


A 


T 


T 


A 


A 


G 


A A 


T 


T 


G 


T 


T 


T 


G 


A 


G 


305 


— 


r-"l 

G 


rn 
T 


T 


A T 


A 


A 


G 


T 


A 


G 


A 


A 


A 


T 


G 


T 


A 


T 


A 


G 


A 


A 


G 


306 




A 


G 


T 


A A 


T 


T 


A 


G 


T 


T 


T 


G 


A 


A 


A 


T 


G 


T 


G 


T 


A 


G 


T 


307 


- 


G 


A 


A 


A G 


A 


T 


T 


A 


T 


G 


A 


T 


T 


G 


T 


A 


A 


A 


G 


T 


G 


A 


T 


308 




G 


T 


A 


A G 


A 


T 


T 


A 


G 


A 


A 


. G 


T 


T 


A 


A 


T 


G 


A 


A 


G 


A 


A 


309 


48 


G 


A 


G 


A A 


T 


G 


T 


T 


G 


A 


A 


T 


A 


A 


G 


A 


A 


G 


T 


A 


A 


T 


T 
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m 


m 

T 
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A 


G 


T 


G 


T 


T 


T 


G 


A 


A 


T 


A 


G 


T 


G 


T 


T 


T 


A 


311 




7\ 

A 


T 


AAA 


G 


A 


A 


A 


G 


A 


G 


T 


A 


T 


G 


A 


G 


A 


T 


T 


A 


T 


G 


312 


— 


A 

A 


G 


T 


T A 


T 


T 


G 


A 


T 


T 


G 


A 


A 


G 


A 


T 


G 


A 


G 


A 


A 


A 


T 


313 




G 


rp 

1 


T 


T G 


T 


G 


T 


T 


T 


G 


T 


A 


T 


A 


A 


G 


T 


T 


G 


T 


T 


A 


A 


314 


50 


rp 
1 


T 


G 


T A 


T 


G 


T 


G 


A 


G 


T 


T 


T 


A 


G 


A 


T 


T 


A 


A 


T 


G 


A 


315 


— 


rp 
I 


A 


G 


T T 


A 


A 


A 


G 


T 


A 


T 


A 


G 


T 


T 


G 


T 


T 


T 


G 


A 


G 


T 


316 




A 

A 


A 

A 


A 


T T 


rp 
X 


G 


T 


G 


T 


T 


G 


A 


G 


A 


T 


T 


T 


G 


T 


A 


T 


A 


G 


317 




rp 


A 


T 


T A 


G 


T 


G 


T 


T 


A 


T 


G 


A 


T 


A 


A 


A 


G 


A 


G 


A 


A 


G 


318 




rp 


A 


T 


A A 


G 


A 


A 


G 


T 


A 


A 


T 


T 


T 


G 


A 


G 


A 


A 


G 


A 


G 


T 


319 




rp 
i 


A 

A 


A 


G T 


T 


G 


A 


G 


A 


T 


G 


T 


T 


T 


G 


T 


T 


T 


G 


A 


T 


A 


A 


320 




p 
G 


rp 
1 


G 


T A 


G 


A 


T 


T 


T 


A 


T 


G 


A 


A 


T 


T 


G 


A 


G 


T 


A 


A 


T 


321 




rp 

JL 


A 

A 


T 


A G 


A 

A 


G 


A 


A 


G 


rp 

r 


G 


rp 

T 


T 


T 


A 


rt 

G 


T 


T 


G 


T 


A 


T 


A 


3.22 




A 

A 


rp 


A 


A A 


G 


A A 


G 


A 


A 


T 


A 


G 


T 


T 


G 


T 


T 


G 


T 


G 


T 


A 


323 




A 

A 


p 

G 


A 


T T 


G 


A A 


A 

A 


rp 

T 


A 

A 


G 


A 

A 


T 


T 


A 


G 


AAA 


G 


T 


T 


G 


324 




p 

G 


rp 
1 


T 


G T 


r 


A 


T 


A 


A 


G 


A 


A 


A 


T 


A 


G 


T 


T 


T 


G 


T 


T 


G 


325 




A 


r* 


A 


A A 


rp 
X 


A 


G 


A 


G 


T 


A 


A 


G 


A 


G 


T 


G 


T 


T 


T 


A 


A 


A 


326 




A 
A 


P 


A 


G A 


rp 
X 


A 


G 


rp 
X 


A 


G 


X 


A 

A 


A 


A 


T 


A 


G 


T 


T 


A 


T 


T 


G 


327 




A 


A 

A 


A 


T G 


A 


T 


T 


G 


rp 
1 


G 


T 


A 


A 


G 


T 


T 


A 


T 


G 


T 


A 


T 


G 


328 




A 
A 


A 

A 


G 


A A 


G 


T 


A 


A 

A 


G 


A 

A 


G 


A 

A 


G 


A 


A 


A 


T 


T 


T 


G 


A 


A 


T 


329 




n 




G 


T G 


rp 
X 


A 


T 


rp 
X 


rp 

X 


A 

A 


G 


rp 
X 


T 


G 


A 

A 


rp 
X 


A A 


T 


T 


G 


A 


T 


330 




A 

A 


rp 
X 


T 


G T 


rp 
X 


G 


T 


rp 
X 


G 


rp 
X 


rp 
X 


rt 
G 


A 


G 


A 

A 


A 

A 


A 


T 


G 


T 


A 


T 


T 


331 




A 

.M 


p 
Is? 


A 


T A 


A 
A 


G 


T 


rp 
X 


A 
A 


A 
A 


A 
A 


p 

G 


T 


A 


A 

A 


A 

A 


G 


A 


G 


A 

A 


A 

A 


rp 

T 


G 


•3 *3 1 

332 




T 


A 
A 


G 


T T 


p 
G 


A 


A 


p 

G 


rp 


rp 
X 


A 
A 


G 


T 


T 


rp 

T 


A 

A 


A 


G 


T 


G 


T 


T 


A 


333 






P 
G 


T 


A A 


p 
G 


A 


A 


T 


rt 

G 


rp 
X 


A 

A 


A 


T 


A 


T 


G 


A 


T 


G 


A 


T 


A 


G 


334 




A 
A 


rp 


G 


A G 


A 


T 


T 


G 


A 


A 


A 


G 


A 


T 


T 


T 


A 


T 


G 


A 


A 


T 


G 


335 




rp 


G 


A 


rp rp 


G 


A 


A 


T 


T 


A 


G 


A 


G 


A 


G 


A 


A 'T 


G 


T 


A 


T 


A 


336 




A 


G 


T 


T A 


G 


T 


A 


A 


G 


A 


G 


A 


A 


T 


A 


T 


A 


G 


T 


G 


A 


A 


T 


337 




A 


T 


T 


A A 


G 


A 


T 


T 


G 


T 


A 


T 


A 


G 


T 


T 


A 


G 


T 


G 


A 


T 


G 


338 




G 


A 


G 


A T 


A 


A 


A 


G 


A 


A 


T 


T 


G 


A 


A 


A 


T 


A 


G 


A 


A 


G 


A 


339 




A 


G 


A 


G T 


A 


A 


A 


T 


G 


T 


T 


A 


A 


G 


A 


A 


A 


G 


A 


A 


G 


T 


T 


340 




A 


A 


A 


G T 


T 


T 


G 


T 


T 


A 


T 


G 


T 


G 


T 


G 


A 


A 


G 


A 


A 


T 


T 


341 




A 


T 


T 


G T 


G 


T 


T 


T 


A 


A 


G 


A 


A 


A 


T 


A 


T 


G 


A 


T 


G 


A 


G 


342 




T 


A 


T 


T G 


A 


A 


A 


T 


G 


A 


G 


A 


T 


G 


T 


A 


T 


G 


T 


A 


G 


T 


T 


343 




A 


T 


T 


T G 


T 


G 


T 


G 


A 


T 


G 


T 


T 


T 


G 


A 


A 


A 


T 


A 


T 


G 


A 


344 




T 


A 


A 


G A 


T 


A 


A 


T 


A 


G 


T 


G 


A 


G 


A 


G 


A 


A 


A 


T 


T 


G 


A 


345 




A 


T 


T 


T A 


T 


G 


A 


T 


T 


A 


G 


T 


G 


T 


A 


A 


G 


T 


G 


T 


T 


G 


T 


346 




G 


A 


T 


T A 


A 


G 


A 


A 


T 


A 


A 


A 


G 


T 


G 


T 


G 


A A 


G 


A 


A 


T 


347 
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Table I 

Sequence SEQ ID NO: No. in 

Ex 1 



Q 


T 


A 


A 


X 


T 

X 


VJ 


A 


rp 
X 


r* 

VJ 


A 


A 


VJ 


A 
A 


G 


rp 
X 


T 


A 


r-i 
VJ 


rp 
1 


rp 
1 


rp A 
1 A 


rp 


34 8 




T 


Q 


T 


G 


T 


X 




T 


ri 

VJ 


rp 
X 


T 


A 


rp 
X 


A 
A 


A 
A 


r* 

Vj 


A A 


/-i 
VJ 


rp 
X 


G 


A X 


A 

.A 


343 






V? 




G 


A 


A 


A 

rt. 


T 

X 


X 


r» 

VJ 


A A 


rp 
X 


rp 
X 


rp 
X 


A 
A 


G 


A 


A 

A 


A 


rp 


r* rp 

G I 


G 


"3 C rt 

350 




T 


T 




T 


T 


rz 


A 


A 
r\ 


T 

X 


VJ 


T 


G 


A 
A 


r* 

VJ 


A 
A 


A 

A 


A 


G 


rp 
X 


A 

A 


rp 
X 


rp rp 


G 


351 




T 


ft 
vj 


T 


T 


A 


A 


X 


VJ 


A 
A 


VJ 


A 


A 


VJ 


A 

A 


rp 
X 


A 

A 


A 
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ATA 


G 


A 


T 


G 


T 


T 
1 


T 
1 


A 


G 


rp 
1 


rp 
1 


rt 
la 


A 
A 


r* 
la 


A 
A 


rp 
X 


rp 
X 


A 

A 


rp 
X 


T 


G 


437 




A A G 


A 


G 


T 


G 


T 


A 

A 


A 
A 


A 


T 


A 

A 


la 


A 
A 


A 

A 


A 

A 


la 


rp 
X 


r» 
la 


A 

A 


rp 
X 


A 


T 


438 




T G T 


G 


T 


A 


T 


T 


ri 

la 


A 

A 


T 


T 


r» 

la 


rp 


rp 


rt 
lj 


A 


/-i 
lj 


A 


T 


A 


A 


A 


T 


439 




TAGTATAG 


T 
1 


G 


A 


G 


A 
A 


A 

A 


A 
A 


la 


A 


la 


rp 
X 


rp 
X 


A 


A 


A 


T 


440 




AAA 


G 


A 


T 


A 


A 


a 

va 


A 
A 


A 


A 


r» 

la 


A 

A 


la 


A 
A 


rp 
1 


la 


a 
A 


rp 
X 


ri 
la 


T 


T 


T 


A A 1 

44 1 




G A A 


G 


T 


T 


A 


T 


T 
1 


la 


A 


A 


A 

A 


rp 
1 


A 

A 


la 


TV 

A 


la 


A 


A 
A 


ri 
la 


T 


A 


T 


442 




A T G 


T 


A 


T 


G 


T 


A 
A 


rp 
X 


A 


G 


A 
A 


A 
A 


A 

A 


r* 
la 


A 
A 


la 


rp 
1 


a 

A 


a 

A 


A 


T 


G 


a a *a 
443 




GAT 


G 


T 


T 


T 


G 


T 
1 


A 
A 


A 


A 


r» 

la 


A 
A 


rp 

1 


rp 
1 


rt 
va 


a 
A 


A 


a 

A 


T 
X 


T 


G 


A 


444 




A A T 


T 


T 


A 


G 


A 


ia 


A 
A 


G 


T 


A 
A 


rp 
1 


T 


T 


la 


T 
1 


la 


rp 

i 


rp 
1 


G 


T 


A 


A A C 

445 




A A T 


T 


T 


G 


T 


T 


X 


rx 


A 


A 


A 

ri 


n 


A 


A 


A 
A 


ia 


rp 

1 


a 

A 


a 

A 


G 


T 


G 


A A C 

44o 




AAA 


G 


A 


G 


T 


A 


la 


T 
X 


G 


T 


T 


A 
A 


T 


T 


r* 

va 


rp 
1 


T 
1 


a 

A 


la 


A 


T 


A 


447 




G T A 


T 


G 


T 


T 


G 


T 
X 


A 
A 


T 


A 


rp 
1 


ri 
la 


T 


T 


a 

va 


T 
1 


T 
1 


la 


A 

A 


T 


A 


T 


/t x a 
44 8 




G T A 


G 


A 


A 


T 


T 


T 
1 


la 


T 


T 


la 


A 

A 


G 


T 


A 

A 


rp 
1 


rp 
1 


rp 
X 


ri 
la 


T 


A 


A 


449 




A T G 


A 


A 


T 


T 


T 


A 
ri 




T 


T 


A 


ex 

la 


T 


G 


1 


a 

A 


a 

A 


*a 


a 

A 


A A 


G 


A c n 
4d0 




A T G 


A 


T 


A 


A 


G 


A 


A 


A 


T 


G 


T 


T 


G 


A 


T 


G 


A 


A 


G 


T 


A 


451 




T T G 


A 


T 


G 


A 


T 


G 


A 


A 


G 


A 


T 


A 


A 


T 


G 


T 


A 


G 


A - 


T 


A 


452 




A G A 


T 


G 


A 


T 


A 


T 


G 


A 


T 


A 


T 


A 


G 


A 


T 


T 


A 


G 


A 


T 


G 


453 




T T G 


A 


A 


A 


G 


T 


T 


A 


G 


A 


A 


A 


G 


A 


T 


A 


G 


A 


T 


G 


T 


T 


454 




G T T 


T 


A 


A 


T 


G 


T 


T 


A 


G 


T 


T 


A 


G 


A 


A 


A 


G 


T 


A 


A 


G 


455 




GAG 


A 


T 


T 


T 


A 


A 


G 


T 


T 


T 


G 


A A 


G 


T 


G 


A 


A 


A 


T 


A 


456 




T T T 


G 


T 


T 


A 


G 


T 


A 


G 


T 


T 


G 


T 


T 


A 


T 


A 


A 


G 


A 


G 


A 


457 




TAT 


G 


A 


G 


A 


A 


T 


A 


G 


T 


T 


T 


G 


T 


T 


A 


G 


T 


G 


A 


A 


T 


458 




T T G 


A 


A 


A 


G 


T 


T 


T 


A 


A 


A 


G 


A 


A 


G 


A 


G 


A 


T 


A 


A 


G 


459 




A A G 


T 


G 


A 


G 


T 


T 


G 


A A 


A 


T 


G 


A 


A 


A 


T 


A 


T 


G 


T 


T 


460 




G T T 


A 


G A 


A 


A 


T 


G 


AAA 


T 


G 


A 


G 


T 


A 


G 


T 


T 


A 


T 


461 
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Table X. 











Sequence 
























SEQ 


ID NO: 


No • in 
Ex 1 


T A A 


G T 


A 


T 


T 


G 


T 


A T 


T 


T 


G 


T 


G 


T 


G 


T 


G 


T 


A 


T 


462 


~ 


T G T 


A T 


T 


A 


G 


T 


AAA 


G 


A 


A 


G 


A 


G 


A 


G 


A 


A 


T 


A 


463 


— 


GAGAAGAGAAATAAGTTGAAATAA 


4 64 


— 


G T A 


A A 


G 


T 


A 


G 


A 


A A 


T 


A 


G 


A 


A 


T 


T 


G 


A 


G 


T 


T 


465 


— 


G T G 


T G 


T 


T 


A 


T 


T 


T G 


T 


T 


T 


G 


T 


A 


A 


A 


G 


T 


A 


T 


466 


69 


T T T 


G A 


T 


G 


T 


A 


T 


G A 


A 


T 


A 


T 


A 


G 


T 


A 


T 


G 


A 


G 


467 


- 


A A G 


A T 


T 


G 


T 


G 


T 


G A 


A 


T 


A 


G 


T 


T 


G 


A 


A 


A 


T 


T 


468 


*~ 


TAT 


A A 


A 


G 


T 


T 


T 


G A 


A 


G 


A 


T 


G 


A 


G 


T 


G 


A 


T 


A 


469 


~~ 


A G A 


T A 


A 


A 


G 


A 


G 


A T 


T 


T 


A 


A 


G 


A 


T 


G 


T 


A 


T 


G 


470 


71 


G A A 


G A 


A 


T 


T 


A 


A 


G T 


T 


G 


A 


G 


A 


A 


T 


T 


A 


A 


G 


A 


471 


- 


TAG 


A G 


A 


A 


A 


T 


T 


T G 


A 


T 


A 


A 


A 


G 


A 


A 


A 


G 


A 


G 


472 


- 


AAA 


G T 


T 


T 


A 


T 


G 


A A 


G 


T 


T 


A 


T 


T 


G 


A 


G 


T 


A 


G 


473 


• - 


AAA 


T A 


G 


T 


G 


T 


A 


A G 


T 


A 


A 


A 


G 


A 


G 


A 


T 


G 


A 


T 


474 


- 


TAT 


G A 


T 


G 


A 


T 


T 


T A 


G 


T 


T 


A 


T 


A 


A 


G 


A 


G 


T 


G 


475 


- 


TAG 


A T 


A 


A 


A 


T 


G 


T T 


A 


T 


G 


A 


T 


G 


A 


G 


T 


A 


A 


G 


476 


- 


A G A 


T T 


G 


A 


T 


T 


G 


T G 


A 


T 


G 


A 


T 


T 


T 


G 


T 


A 


T 


A 


477 


- 


T T A 


A G 


A 


A 


G 


A 


A 


T T 


G 


T 


A 


T 


A 


T 


G 


A 


G 


A 


G 


T 


478 


73 


G T A 


G A 


A 


T 


G 


T 


T 


T A 


G 


A 


G 


T 


T 


G 


A 


A 


T 


A 


T 


A 


479 


- 


GAG 


A A 


A 


T 


A 


G 


T 


A A 


G 


A 


A 


G 


T 


A 


A 


A 


T 


A 


G 


A 


480 


- 


ATT 


G A 


A 


G 


T 


T 


G 


T T 


A 


T 


G 


T 


G 


A 


A 


G 


A 


T 


T 


T 


481 


- 


T A A 


A T 


G 


T 


T 


G 


T 


G T 


A 


G 


A 


G 


T 


A 


A 


T 


T 


A 


G 


A 


482 


- 


AAA 


TAAGAGTTT 


G 


A 


G 


A 


A 


G 


T 


T 


G 


T 


T 


T 


483 


- 


A G T 


T G 


T 


A 


A 


T 


A 


A G 


A 


A 


G 


T 


G 


A 


T 


T 


T 


A 


A 


G 


484 


- 


G T T 


A G 


A 


A 


T 


G 


T 


A T 


A 


T 


A 


G 


A 


G 


T 


T 


A 


G 


A 


T 


485 


74 


T T G 


A T 


A 


T 


T 


G 


A 


A A 


G 


A 


G 


A 


A 


A 


G 


T 


T 


A 


T 


G 


486 


- 


T T A 


A A 


G 


A 


G 


A 


G 


A A 


A 


T 


G 


T 


T 


T 


G 


A 


T 


T 


A 


G 


487 


- 


T G T 


G A 


A 


T 


T 


T 


G 


A G 


T 


A 


T 


T 


A 


G 


T 


A 


A 


G 


A 


A 


488 


- 


T A A 


T T 


T 


G 


A 


A 


T 


G T 


G 


A 


A 


A 


G 


T 


T 


G 


T 


T 


A 


G 


489 


- 


A T G 


T G 


T 


T 


T 


G 


A 


A A 


G 


A 


T 


G 


A 


T 


G 


A 


T 


T 


T 


A 


490 


- 


A A G 


T T 


A 


T 


G 


T 


T 


G A 


T 


A 


T 


T 


G 


A 


G 


T 


G 


A 


A 


A 


491 


- 


TAG 


A T 


A 


A 


A 


G 


A 


A G 


A 


T 


A 


G 


A 


G 


A 


T 


T 


T 


A 


G 


492 




GAT 


G A 


A 


T 


G 


T 


A 


G A 


T 


A 


T 


A 


T 


G 


T 


A 


A 


T 


G 


A 


493 




G A A 


G A 


A 


T 


A 


G 


T 


T T 


A 


T 


G 


T 


A 


A 


A 


T 


G 


A 


T 


G 


494 


- 


G T A 


G T 


A 


T 


A 


T 


A 


G T 


T 


A 


A 


A 


G 


A 


T 


G 


A 


G 


T 


T 


495 


- 


G T T 


A T 


T 


T 


G 


T 


G 


T A 


T 


G 


A 


T 


T 


A 


T 


G 


A 


T 


T 


G 


496 


- 


A G A 


G A 


T 


T 


A 


G 


A 


A A 


T 


T 


G 


A 


G 


A 


G 


A 


A 


T 


T 


A 


497 


- 


G T A 


T G 


A 


T 


A 


G 


A 


G T 


T 


T 


A 


T 


A 


G 


T 


G 


A 


T 


A 


A 


498 


- 


G T T 


A G 


A 


A 


A 


G 


A 


A T 


G 


A 


A 


A 


T 


T 


G 


A 


A 


G 


T 


A 


499 


- 


A A G 


A A 


T 


G 


A 


G 


A 


A T 


A 


T 


A 


G 


A 


G 


A 


T 


G 


A 


A 


T 


500 


— 


AAA 


G A 


G 


A 


A' 


T 


A 


G T 


G 


T 


T 


T 


A 


A 


G 


A 


A 


G 


A 


T 


501 


- 


GAT 


G T 


G 


T 


T 


A 


T 


T G 


A 


T 


A 


G 


A 


A 


A 


T 


T 


A 


G 


A 


502 




TAG 


A G 


T 


T 


A 


T 


A 


G A 


G 


A 


T 


A 


T 


T 


G 


T 


A 


T 


G 


A 


503 


— 


GAG 


A G 


T 


T 


G 


A 


A 


T A 


A 


G 


T 


T 


A 


A 


A 


G 


A 


T 


A 


T 


504 


- 


A G A 


T A 


T 


G 


A 


A 


A 


T A 


G 


A 


T 


T 


G 


T 


T 


A 


G 


A 


G 


A 


505 




GAG 


T G 


A 


A 


T 


A 


G 


A A 


A 


G 


A 


T 


A 


T 


G 


T 


T 


A 


A 


T 


506 


- 


AAA 


G A 


G 


A 


T 


A 


T 


T G 


A 


A 


G 


A 


G 


A 


A 


T 


A 


A A 


G 


507 




G T T 


A T 


A 


G 


A 


A 


T 


A A 


G 


T 


T 


G 


T 


A 


A 


A 


G 


T 


G 


T 


508 


- 


T G A 


T A 


G 


T 


A 


T 


G 


A T 


A 


A 


T 


G 


T 


G 


T 


T 


T 


A 


T 


G 


509 




T T T 


G T 


T 


G 


T 


T 


A 


A G 


T 


A 


T 


G 


T 


G 


A 


T 


T 


T 


A 


G 


510 


77 


T A A 


A G 


T 


G 


T 


T 


G 


T G 


T 


T 


A 


A 


A 


G 


A 


T 


T 


A 


A 


G 


511 




T G T 


G T 


T 


T 


G 


A 


T 


T G 


A 


T 


T 


A 


A 


T 


G 


T 


T 


A 


T 


G 


512 




ATT 


A A 


T 


G 


A 


A 


T 


G A 


G 


T 


G 


T 


T 


G 


T 


A 


A 


T 


G 


T 


513 




TAG 


A T 


G 


T 


T 


T 


G 


T G 


A 


G 


T 


T 


T 


G 


A 


T 


A 


T 


T 


A 


514 




G A A 


T G 


A 


A 


T 


A 


G 


T A 


A 


T 


A 


G 


A 


T 


G 


A 


T 


T 


T 


G 


515 




A A T 


A G 


T 


G 


T 


G 


T 


T G 


T 


T 


A 


T 


A 


T 


G 


A 


T 


T 


A 


G 


516 




TAG 


A T 


T 


A 


G 


A 


A 


G A 


T 


G 


T 


T 


G 


T 


G 


T 


A 


T 


T 


A 


517 




A A T 


G T 


G 


T 


G 


T 


G 


T T 


A 


A A 


T 


G 


A 


A 


T 


T 


T 


G 


T 


518 
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Table I 

Sequence SEQ ID NO: No. in 

Ex 1 



CI 71 ft. 
o A 


T 


T 


A 

A 


A 
A 


r* 
ij 


rp 
X 


A 

A 


rp 
i 


A 


T 


r-% 

K3 


A 

A 


r~» 
VJ 


rp 
X 




rp 

X 


A 


G 


A 


A 


A 


519 




T t a 

1 1 rl 


T 


T 


ci 

\3 


1 


r* 

Vjr 


rp 


/-» 
\j 


rp 
1 


A 


A 


c* 

\3 


rp 
X 


A 
A 


r* 

VJ 


rp 

X 


r-i 

(jr 


T 


A 


A 


A 


T 


C ") f\ 




G T A 

w 1 A 


G 


T 


A 

A 


A 
A 


A 
A 


r* 


A 

A 


r> 
\j 


A A 


rp 
1 


rp 
X 


ft 
\J 


rp 
X 


rp 
X 


rp 

X 


A 


G 


T 


A 


T 


D £ X 


80 


a a fi 

n n u 


T 


T 


rp 
1 


p 


rp 


A 

A 


A 

A 


r* 
\J 


A 


A 


r» 


rp 
1 


A 

A 


r* 

\j 


rp 
X 


rp 
X 


G 


A A 


T 


A 


c n o 




n « i 


T 


A 


rp 
X 


TV 

A 


r% 


rp 
1 


A 


rp 
1 


A 


G 


rp 
X 


A 


/-» 
L» 


rp 

X 


TV 
A 


rp 

T 


A 


G 


A 


G 


A 


rr i *5 
523 




V3 n. A 


A 


G 


A 

A 


A 


A 

A 


rp 


/-I 

VJ 


rp 


G 


T 


A 

A 


rp 
X 


A 

A 


r*\ 
\D 


rp 


rp 
X 


T 


A 


A 


T 


G 


524 




T t n 


T 


G 


A 

A 


r* 

O 


rp 


A 
A 


A 

A 


rp 

X 


G 


A 


A 

A 


rp 

T 


r* 
\j 


A 

A 


T 


G 


T 


A 


T 


T 


A 


525 




o 1 A 


G 


A 


rr 

lj 


rp 


rp 
X 


c* 

lj 


rp 
X 


A 
A 


A 


A 


rp 
X 


A 




A 


Cj 


A 


A T A A A 


526 




a t t 

All 


A A 


rp 
J. 


r* 
Vj 


rp 
X 


A 

A 


r* 

Vj 


A 

A 


T 


T 


/-I 

\J 


rp 
X 


A 

A 


TV 

A 




A 


G 


A 


T 


A 


G 


C O H 

52 7 




1 1 A 


G 


T 


/~t 
*a 


rp 


/-» 
Vj 


T 


T 


T 


G 


T 


A 


G 


A 


T 


A 


G 


A 


A 


T 


T 


A 


52 8 




a r* a 

A Vj A 


G 


A 


/-i 
(j 


rp 
X 


T 


rp 

r 


G 


T 


G 


T 


A 


T 


A 


T 


G 


T 


A 


T 


A 


A 


A 


529 


81 


1 1 A 


A 


G 


rp 


rp 
X 


T 


A 


G 


T 


G 


A 


G 


A 


T 


T 


T 


G 


T 


T 


A 


A 


G 


53 0 




TV rp f>i 
A X O 


A 


A 


G 


T 


T 


T 


A 


T 


T 


G 


A 


A 


T 


A 


G 


T 


A 


G 


T 


G 


A 


531 




ATA 
A 1 A 


T 


T 


rp 
1 


r* 
\j 


rp 


Lj 


T 


T 


G 


T 


A 


T 


G 


T 


T 


T 


G 


T 


G 


A 


A 


532 




AAA- 
AAA 


G 


T 


/~i 
<J 


T 


T 


T 


A 


T 


A 


G 


A 


A 


G 


A 


T 


T 


T 


G 


A 


T 


G 


533 




A A C* 

A- A Vj 


A 


G 


A 


T 


A 


T 


G 


A 


T 


T 


T 


G 


T 


T 


A 


G 


T 


T 


G 


T 


A 


534 




A A G 


A 


A 


G 


A 


-A 


A 


T 


G 


A 


G 


T 


G 


A 


T 


A 


A 


T 


G 


T 


A 


A 


535 




TAG 


T 


G 


T 


T 


T 


G 


A 


T 


A 


T 


G 


T 


T 


A 


A 


G 


A 


A 


G 


T 


T 


536 




G T A 


G 


A 


A 


A 


G 


T 


G 


A 


T 


A 


G 


A 


T 


T 


A 


G 


T 


A 


A 


T 


A 


537 


— 


GAT 


A 


A 


A 


T 


G 


T 


T 


A 


A 


G 


T 


T 


A 


G 


T 


A 


T 


G 


A 


T 


G 


538 




A G A 


T 


T 


A 


G 


A 


A 


G 


A 
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Table I 

Sequence SEQ ID NO: No. in 

Ex 1 
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Table I 

Sequence SEQ ID NO:. No. in 

Ex 1 



GTGAAATG 


A 


T T 


T 


AGAGTAATA A *G T 


633 


- 


AAATAAGA 


A 


T A 


G 


AGAGAGAAAGTT 


634 


9 


GTTGTAAA 


G 


T A 


A 


TAGAGAAATTAG 


635 


- 


AGTGA.TTT 


A 


G A 


T 


TATGTGATGATT 


636 


- 


AGAGTATA 


G 


T T 


T 


AGATTTATGTAG 


637 


- 


A T G A T T A -G 


A 


T A 


G 


TGAAATTGTTAG 


633 


- 


ATGAAATG 


T 


A T 


TAGTTTAGAGTTG 


639 


- 


ATATTGAG 


T 


G A 


G 


AGTTATTGTTAA 


64 0 


- 


AGATGTGT 


A 


T T 


G 


AATTAAGAAGTT 


641 


- 


TAATGTGT 


T 


G A 


T 


AGAATAGAGATA 


642 


- 


AAATTAGT 


T 


G A 


A 


AGTATGAGAAAG 


643 


11 


TTTAGAGT 


T 


G A 


A 


GAAATGTTAATG 


644 


- 


GATTGTTGATTA. 


TTGATGAATTTG 


645 


- 


TGTTGTTG 


T 


T G 


A 


ATTGAAGAATTA 


646 


- 


ATTAAGTA 


A 


G A 


A 


TTGAGAGTTTGA 


647 


12 


GTATGTTG 


T 


A A 


T 


GTATTAAGAAAG 


648 


15 


TAGTTGTG 


A 


T T 


T 


ATGTAATGATTG 


649 


- 


TGATAATG 


A 


A A 


G 


TTTATAGAGAGA 


650 


- 


GTAAGATT 


G 


T T 


T 


GTATGATAAGAT 


651 


- 


TTGAATTA 


A 


G A 


G 


TAAGATGTTTAG 


652 


- 


AAGTGTTT 


G 


T T 


T 


AGAGTAAAGATA 


653 


- 


AGAGAGAT 


A 


A A 


G 


TATAG AAG TTAA 


654 


- 


ATTATGAA 


T 


A G 


T 


TAGAAAGAGAGT 


655 


- 


T T G T T G AT 


A 


T T 


A 


GAGAATGTGTTT 


656 


- 


TTTATTGA 


G 


A G 


T 


TTGTTATTTGTG 


657 


- 


AGTGTTAA 


G 


A A 


G 


TTGATTATTGAT 


658 


- 


GAGAAATG 


A 


T T 


G 


AATGTTGATAAT 


659 


- 


GATAAGTA 


T 


T A 


G 


TATGAGTGTAAT 


660 


- 


TTTGATTT 


A 


A G 


A 


GTGTTGAATGTA 


661 


16 


AAGTTAGT 


A 


A A 


T 


AGAGTAGAAAGA 


662 




GTAAAGTA 


T 


G A 


A 


TATGTGAAATGT 


663 




TAATAAGTGTGTTGTGAATGTAAT 


664 


- 


AAAGATTT 


A 


G A 


G 


TAGAAAGAGAAT 


665 


- 


TTAGTTTG 


A 


G T 


T 


GAAATAGTAAAG 


666 


- 


TAATAGTA 


T 


G A 


G 


TAAGATTGAAAG 


667 


- 


GAAGATTA 


G 


A T 


T 


GATGTTAGTTAA 


668 


- 


TAAAGAGA 


G 


A A 


G 


TTAGTAATAGAA 


669 


- 


TAAGTATG 


A 


G A 


A 


ATGATGTGTTAT 


670 


- 


GAGTTTGT 


T 


T G 


T 


TAGTTATTGATA 


671 


- 


AAGTAAAG 


A 


A A 


T 


GTTAAGAGTAGT 


672 




ATGAGAAT 


T 


G T 


T 


GTTGAAATGTAA 


673 




TTAGATTA 


G 


A G 


T 


AGTAGAAGAATA 


674 


- 


TAGTGATG 


A 


A G 


A 


AGTTAGAAATTA 


675 


- 


TAATGTAG 


T 


A A 


T 


GTGATGATAAGT 


676 


- 


TTGAGAAA 


G 


A A 


T 


AAGTAGTGTAAA 


677 


- 


TAATGAGT 


G 


A G 


A 


TTATAGATTGTT 


678 




GTATAAGA 


A 


A T 


G 


TGTGTTTGATTA 


' 679 




GTGAATGT 


G 


T T 


A 


ATGAAGATATAT 


680 




GAAAGTTA 


T 


T A 


G 


TAGTTAAAGATG 


681 




TAGAATTG 


T 


G T 


T 


TGATAAGTGATA 


682 




TGATTTAG 


A 


T T 


G 


AGAGTTAAATGA 


683 




ATTATTGA 


G 


T T 


T 


GAATGTTGATAG 


684 




ATAGTAGT 


T 


A T 


G 


TTTGATTTAGTG 


685 




ATAGAAGA 


A 


G A 


A 


TAAAGTTAGAGA 


666 




GATGTTGA 


A 


A G 


T 


AATGAATTTGTA 


687 




GAGATTGA 


T 


A G 


T 


AGAAATGATAAA 


688 




TGAGAGAA 


T 


A A 


A 


G T A T G A AT T T G A 


689 
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Table I 

















Sequence 
























SEQ 


ID NO: 


No. in 
Ex 1 


T 


A 


T 


A A 


A 


G 


A 


T 


G A 


T 


G 


T 


G 


A 


A 


T T 


AG 


T 


A 


G 


690 


- 


T 


T 


A 


T 


G 


T 


A 


A 


G 


A A 


T 


G 


T 


T 


T 


G 


A 


G 


A 


G 


A 


A 


A 


691 


- 


A 


G 


T 


A 


A 


A 


T 


G 


A 


T 


G 


A 


A 


T 


G 


A 


T 


A 


T 


G 


A 


T 


G 


A 


692 


- 


G 


A 


A 


A 


T 


T 


T 


G 


T 


G 


T 


T 


A 


A 


A 


G 


T 


T 


G 


A 


A 


T 


G 


A 


693 


- 


G 


A 


T 


G 


A 


A 


T 


G 


A 


T 


T 


G 


T 


G 


T 


T 


T 


A 


A 


G 


T 


A 


T 


A 


694 


- 


G 


A 


A 


A 


T 


A 


A 


G 


T 


G 


A 


G 


A 


G 


T 


T 


A 


A 


T 


G 


A 


A 


A 


T 


695 


- 


T 


G 


T 


T 


G 


A 


A 


A 


T 


A 


G 


T 


T 


A 


T 


T 


A 


G 


T 


T 


T 


G 


T 


G 


696 


- 


T 


T 


T 


G 


A 


G 


A 


G 


T 


A 


T 


A 


T 


T 


G 


A 


T 


A 


T 


G 


A 


G 


A 


A 


697 


- 


A 


T 


T 


G 


T 


G 


T 


G 


T 


A 


A 


A 


G 


T 


A 


A 


G 


A 


T 


T 


T 


A 


A 


G 


698 


- 


T 


A 


T 


A 


G 


T 


T 


T 


G 


A 


A 


G 


T 


G 


T 


G 


A 


T 


G 


T 


A 


T 


T 


T 


699 


- 


G 


T 


G 


A 


A 


G 


T 


T 


A 


T 


A 


G 


T 


G 


T 


A 


T 


A 


A 


A 


G 


A 


A 


T 


700 


- 


G 


T 


A 


T 


G 


T 


T 


G 


A 


A 


T 


A 


G 


T 


A 


A 


A 


T 


A 


G 


A 


T 


T 


G 


701 


- 


T 


T 


A 


G 


A 


A 


A 


G 


T 


G 


T 


G 


A 


T 


T 


T 


G 


T 


G 


T 


A 


T 


T 


T 


702 


- 


T 


T 


T 


A 


G 


T 


A 


A 


T 


A 


T 


G 


T 


A 


A 


G 


A 


G 


A 


T 


G 


T 


G 


A 


703 


- 


A 


G 


T 


A 


T 


G 


T 


A 


T 


A 


G 


A 


T 


G 


A 


T 


G 


T 


T 


T 


G 


T 


T 


T 


704 


- 


A 


T 


T 


T 


A 


A 


G 


T 


A 


A 


A 


G 


T 


G 


T 


A 


G 


A 


G 


A 


T 


A 


A 


G 


705 


20 


A 


T 


T 


T 


G 


T 


G 


T 


T 


G 


A 


A 


T 


T 


G 


T 


A 


A 


A 


G 


T 


G 


A 


A 


706 


- 


A 


T 


G 


T 


T 


A 


T 


T 


A 


G 


A 


T 


T 


G 


T 


G 


A 


T 


G 


A 


A 


T 


G 


A 


707 


- 


T 


A 


G 


T 


A 


G 


T 


A 


G 


A 


A 


T 


A 


T 


G 


A 


A 


A 


T 


T 


A 


G 


A 


G 


708 


- 


T 


T 


T 


A 


A 


T 


G 


A 


G 


A 


A 


G 


A 


G 


T 


T 


A 


G 


A 


G 


T 


A 


T 


A 


709 


- 


A 


A 


A 


G 


T 


T 


T 


A 


G 


T 


A 


G 


A 


G 


T 


G 


T 


A 


T 


G 


T 


A 


A 


A 


710 


- 


A 


T 


A 


T 


A 


T 


G 


A 


T 


A 


G 


T 


A 


G 


A 


G 


T 


A 


G 


A 


T 


T 


A 


G 


711 


- 


T 


G 


A 


G 


A 


A 


G 


T 


T 


A 


A 


T 


T 


G 


T 


A 


T 


A 


G 


A 


T 


T 


G 


A 


712 




T 


A 


T 


A 


G 


A 


G 


A 


T 


G 


T 


T 


A 


T 


A 


T 


G 


A 


A 


G 


T 


T 


G 


T 


713 


- 


• A 


A 


A 


T 


T 


T 


G 


T 


T 


A A 


G 


T 


T 


G 


T 


T 


G 


T 


T 


G 


T 


T 


G 


714 


- 


T 


T 


G 


T 


T 


G 


A A 


G 


A 


T 


G 


A A 


A 


G 


T 


A 


G 


A A 


T 


T 


A 


715 


- 


A 


A 


G 


A G 


A 


T 


A 


A 


G 


T 


A 


G 


T 


G 


T 


T 


T 


A 


T 


G 


T 


T 


T 


716 


- 


A 


A 


T 


A 


A 


G 


A 


A 


G 


A 


A 


G 


T 


G 


A 


A 


A 


G 


A 


T 


T 


G 


A 


T 


717 




T 


A 


A 


G 


T 


T 


A 


A 


A 


G 


T 


T 


G 


A 


T 


G 


A 


T 


T 


G 


A 


T 


A 


G 


718 


- 


A 


T 


A 


T 


A 


A 


G 


A 


T 


A 


A 


G 


A 


G 


T 


G 


T 


A 


A 


G 


T 


G 


A 


T 


719 


- 


G 


T 


T 


A 


A 


A 


T 


G 


T 


T 


G 


T 


T 


G 


T 


T 


T 


A 


A 


G 


T 


G 


A 


T 


720 


- 


G 


A 


G 


T 


T 


A 


A 


G 


T 


T 


A 


T 


T 


A 


G 


T 


T 


A 


A 


G 


A 


A 


G 


T 


721 


- 


T 


A 


T 


T 


A 


G 


A 


G 


T 


T 


T 


G 


A 


G 


A 


A 


T 


A 


A 


G 


T 


A 


G 


T 


722 


21 


T 


A 


A 


T 


G 


T 


T 


G 


T 


T 


A 


T 


G 


T 


G 


T 


T 


A 


G 


A 


T 


G 


T 


T 


723 


- 


G 


A 


A 


A 


G 


T 


T 


G 


A 


T 


A 


G 


A 


A 


T 


G 


T 


A 


A 


T 


G 


T 


T 


T 


724 


- 


T 


G 


A 


T 


A 


G 


A 


T 


G 


A 


A 


T 


T 


G 


A 


T 


T 


G 


A 


T 


T 


A 


G 


T 


725 


- 


A 


T 


G 


A 


T 


A 


G 


A 


G 


T 


A 


A 


A 


G 


A 


A 


T 


A 


A 


G 


T 


T 


G 


T 


726 


- 


A 


G 


T 


A 


A 


G 


T 


G 


T 


T 


A 


G 


A 


T 


A 


G 


T 


A 


T 


T 


G 


A 


A 


T 


727 


27 


A 


T 


G 


T 


A 


G 


A 


T 


T 


A 


A 


A 


G 


T 


A 


G 


T 


G 


T 


A 


T 


G 


T 


T 


728 




T 


T 


A 


T 


T 


G 


A 


T 


A 


A 


T 


G 


A 


G 


A 


G 


A 


G 


T 


T 


A 


A 


A 


G 


729 


- 


A 


T 


T 


T 


G 


T 


T 


A 


T 


G 


A 


T 


A 


A 


A 


T 


G 


T 


G 


T 


A 


G 


T 


G 


730 


29 


T 


T 


G 


A 


A 


G 


A 


A 


A 


T 


A 


A 


G 


A 


G 


T 


A 


A 


T 


A 


A 


G 


A 


G 


731 


- 


T 


G 


T 


G 


T 


A 


A 


T 


A 


A 


G 


T 


A 


G 


T 


A 


A 


G 


A 


T 


T 


A 


G 


A 


732 


- 


A 


T 


G 


A A 


A 


G 


T 


T 


A 


G 


A 


G 


T 


T 


T 


A 


T 


G 


A 


T 


A 


A 


G 


733 


37 


A 


T 


T 


A 


G 


T 


T 


A 


A 


G 


A 


G 


A 


G 


T 


T 


T 


G 


T 


A 


G 


A 


T 


T 


734 


- 


T 


G 


T 


A 


G 


T 


A 


T 


T 


G 


T 


A 


T 


G 


A 


T 


T 


A 


A 


A 


G 


T 


G 


T 


735 


- 


A 


G 


T 


T 


G 


A 


T 


A 


A 


A 


G 


A 


A 


G 


A 


A 


G 


A 


G 


T 


A 


T 


A 


T 


736 


- 


G 


T 


A 


A 


T 


G 


A 


G 


A 


T 


A 


A 


A 


G 


A 


G 


A 


G 


A 


T 


A 


A 


T 


T 


737 




T 


G 


T 


G 


T 


T 


G 


A 


A 


G 


A 


T 


A 


A 


A 


G 


T 


T 


T 


A 


T 


G 


A 


T 


738 




A 


A 


G 


A 


A 


G 


A 


G 


T 


A 


G 


T 


T 


A 


G 


A 


A 


T 


T 


G 


A 


T 


T 


A 


739 




G 


A 


A 


T 


G 


A 


A 


G 


A 


T 


G 


A 


A 


G 


T 


T 


T 


G 


T 


T 


A 


A 


T 


A 


740 




A A 


A 


T 


T 


G 


T 


T 


G 


A 


G 


A 


T 


A 


A 


G 


A 


T 


A 


G 


T 


G 


A 


T 


741 




T 


G 


A 


T 


T 


G 


T 


T 


T 


A A 


T 


G 


A 


T 


G 


T 


G 


T 


G 


A 


T 


T 


A 


742 




A 


T 


G 


A 


A 


G 


T 


A 


T 


T 


G 


T 


T 


G 


A 


G 


T 


G 


A 


T 


T 


T 


A 


A 


743 




G 


T 


G 


T 


A 


A 


A 


T 


G 


T 


T 


T 


G 


A 


G 


A 


T 


G 


T 


A 


T 


A 


T 


T 


744 


46 


A 


A 


T 


T 


G 


A 


T 


G 


A 


G 


T 


T 


T 


A 


A 


A 


G 


A 


G 


T 


T 


G 


A 


T 


745 




T 


T 


T 


G 


T 


G 


T 


A 


A 


T 


A 


T 


G 


A 


T 


T 


G 


A 


G 


A 


G 


T 


T 


T 


746 
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Table I 

Sequence SEQ ID NO: No. in 

Ex 1 



G 


TAG 


T A 


G 


A 


T 


G A 


T 


T 


A A 


G 


A 


A 


G 


A T A A A 


747 


- 


T 


T T 


A 


A 


T 


G 


T 


G 


A A 


A 


T 


T 


T 


G 


T 


T 


G 


T 


G 


A 


G 


T 


748 


- 


G 


T A 


A 


A 


G 


A 


A 


T 


T A 


G 


A 


T 


A 


A 


A 


G 


A 


G 


T 


G 


A 


T 


749 


- 


A 


A T 


A 


G 


T 


T 


A 


A 


G T 


T 


T 


A 


A 


G 


A 


G 


T 


T 


G 


T 


G 


T 


750 


- 


G 


T G 


T 


G 


A 


T 


G 


T 


T T 


A 


T 


A 


G 


A 


T 


T 


T 


G 


T 


T 


A 


T 


751 


- 


G 


T A 


T 


A 


G 


T 


G 


T 


G A 


T 


T 


A 


G 


A 


T 


T 


T 


G 


T 


A 


A 


A 


752 


49 


G 


T T 


G 


T 


A 


AG 


A 


A A 


G 


A 


T 


A 


T 


G 


T 


A 


A G 


A 


A 


A 


753 


- 


A 


T A 


T 


T 


A 


G 


A 


T 


T G 


T 


A 


A 


A 


G 


A 


G 


A 


G 


T 


G 


A 


A 


754 


- 


G 


A G 


T 


G 


A 


T 


A 


T 


T G 


A 


A 


A 


T 


T 


A 


G 


A 


T 


T 


G 


T 


A 


755 


- 


T 


A A 


G 


A 


A 


G 


T 


T 


A A 


A 


G 


A 


A 


G 


A 


G 


A 


G 


T 


T 


T 


A 


756 


- 


'g 


A T 


G 


T 


T 


A 


G 


A 


T A 


A 


A 


G 


T 


T 


T 


A 


A 


G 


T 


A 


G 


T 


757 


- 


G 


T G 


A 


T 


T 


G 


T 


A 


T G 


A 


G 


A 


A 


A 


T 


G 


T 


T 


A 


A 


A 


T 


758 


- 


T 


G A 


T 


T 


A 


T 


T 


G 


T A 


A 


G 


A 


A 


A 


G 


A 


T 


T 


G 


A 


G 


A 


759 


- 


A 


A G 


A 


A 


T 


T 


G 


T 


G T 


A 


A 


G 


T 


T 


T 


A 


T 


G 


A 


G 


T 


A 


760 


- 


T 


T G 


T 


A 


T 


T 


T 


A 


G A 


A 


G 


A 


T 


T 


T 


G 


T 


A 


G 


A 


T 


G 


761 


- 


T 


A T 


A 


T 


G 


T 


T 


T 


G T 


G 


T 


A 


A 


G 


A 


A 


G 


A 


A 


A 


T 


G 


762 


- 


G 


A T 


A 


A 


T 


G 


T 


G 


T G 


A 


A 


T 


T 


T 


G 


T 


G 


A 


A 


T 


A 


A 


763 


- 


T 


T A 


G 


A A 


A 


T 


G 


T G 


A 


G 


A 


T 


T 


T 


A 


A 


G 


A 


G 


T 


T 


764 


- 


A 


G T 


G 


T 


A 


G 


A 


A 


T T 


T 


G 


T 


A 


T 


T 


T 


A 


G 


T 


T 


G 


T 


765 


- 


T 


A G 


T 


T 


A A 


G 


A 


T A 


G 


A 


G 


T 


A 


A 


A 


T 


G 


A 


T 


A 


G 


766 


- 


G 


A A 


G 


T 


G 


A 


T 


A 


T T 


G 


T 


A 


A 


A 


T 


T 


G 


A 


T 


A 


A 


G 


767 


- 


G 


T A 


A 


T 


T 


G 


T 


G 


T T 


A 


G 


A 


T 


T 


T 


A 


A 


G 


A 


A 


G 


T 


768 


- 


T 


G A 


T 


A 


T 


T 


T 


G 


T G 


A 


A 


T 


T 


G 


A 


T 


A 


G 


T 


A 


T 


G 


769 


- 


A 


A G 


T 


A 


A 


A 


G 


A 


G A 


T 


A 


T 


A 


G 


T 


T 


A 


A 


G 


T 


T 


G 


770 


- 


A 


T T 


A 


G 


T 


T 


A 


A 


G T 


T 


A 


T 


T 


T 


G 


T 


G 


A 


G 


T 


G 


A 


771 


- 


A 


G A 


T 


G 


A 


A 


G 


T 


A G 


T 


T 


T 


A 


T 


G 


A A 


T 


T 


A 


G 


A 


772 


- 


T 


G A 


G 


T 


T 


A 


G 


T 


T A 


A 


G 


T 


G 


A 


T 


A 


G 


T 


T 


A 


A 


A 


773 


- 


T 


T A 


T 


T 


G 


T 


A 


G 


A T 


T 


T 


A 


G 


A 


G 


A 


A 


G 


A 


T 


G 


A 


774 


- 


T 


A T 


T 


T 


G 


T 


G 


T 


T T 


G 


T 


T 


G 


A 


T 


T 


A 


G 


A 


T 


A 


G 


775 


- 


G 


T A 


T 


A 


A 


T 


G 


T 


G T 


G 


T 


G 


A 


A 


A 


G 


T 


T 


A 


T 


A 


A 


776 


- 


T 


A T 


A 


T 


G 


T 


T 


G 


A G 


T 


A 


T 


A 


A 


A 


G 


A 


G 


A 


G 


A 


A 


777 


- 


T 


T A 


G 


T 


T 


A 


G 


T 


T T 


A 


A 


A 


G 


A 


T 


T 


G 


T 


G 


A 


G 


T 


778 


- 


T 


T T 


A G A 


A 


T 


A A G 


T 


G 


A 


T 


G 


T 


G 


A 


T 


G 


A 


A 


A 


779 


- 


A 


G A 


G 


T 


A 


A 


T 


G 


T G 


T 


A 


A 


A 


T 


A 


G 


T 


T 


A 


G 


A 


T 


780 


- 


T 


G T 


G 


A 


T 


A 


A 


A 


G A 


G 


A 


A 


A 


T 


T 


A 


G 


T 


T 


G 


T 


T 


781 


- 


G 


A A 


T 


T 


T 


A 


G 


T 


G A 


A 


T 


G 
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G 
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A 


G 


T A 


G 


T 


G 


903 


- 


G 


T T 


T 


A 


G 


T 


A T 


T 


G 


A 


T 


A 


T 


T 


G 


T 


G 


T G 


T 


A 


A 


904 


- 


G 


T G 


T 


T 


G 


T 


G A 


A 


T 


A A 


G 


A 


T 


T 


G 


A 


A A 


T 


A 


T 


905 


- 


A 


A A 


G 


A 


A 


A 


G T 


A 


T 


A 


A 


A 


G 


T 


G 


A 


G 


A T 


A 


G 


A 


906 


- 


T 


A T 


T 


T 


G 


T 


A A 


G 


A 


A 


G 


T 


G 


T 


A 


G 


A 


T A 


T 


T 


G 


907 


- 


T 


A G 


A 


A 


G 


A 


T G 


A 


A 


A 


T 


T 


G 


T 


G 


A 


T 


T T 


G 


T 


T 


908 




A 


T A 


A 


T 


A 


G 


T A 


A 


G 


T 


G 


A 


A 


T 


G 


A 


T 


G A 


G 


A 


T 


909 




A 


A T 


G 


T 


G 


A 


A T 


A 


A 


G 


A 


T 


A 


A 


A 


G 


T 


G T 


G 


T 


A 


910 




A 


T T 


G 


A 


A 


G 


A T 


A 


A 


A 


G 


A 


T 


G 


T 


T 


G 


T T 


T 


A 


G 


911 




T 


G A 


A A 


T 


A 


G A 


A 


G 


T 


G 


A 


G 


A 


T 


T 


A 


T A 


G 


T 


A 


912 


76 


A 


G T 


T 


A 


T 


T 


G T 


G 


A 


A 


A 


G 


A 


G 


T 


T 


T 


A T 


G 


A 


T 


913 




A 


A A 


T 


A 


G 


T 


A G 


T 


G 


A 


T 


A 


G 


A 


G 


A 


A 


G A 


T 


T 


T 


914 




A 


G T 


G 


T 


A 


T 


G A 


A 


G 


T 


G 


T 


A 


A 


T 


A 


A 


G A 


T 


T 


A 


915 




T 


G A 


T 


T 


A 


A 


G A 


T 


T 


G 


T 


G 


T 


A 


G 


T 


G 


T T 


A 


T 


A 


916 




A 


G T 


T 


T 


A 


T 


G -A 


T 


A 


T 


T 


T 


G 


T 


A 


G 


A 


T G 


A 


G 


T 


917 
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Table I 













Sequence 




















SEQ 


ID NO: 


No, in 
Ex 1 


T A T G T 


, G 


T 


A 


T 


G 


A 


A 


G A 


T 


T 


A 


T 


A 


G 


T T 


A 


G 


* 918 


78 


G A A 


A 


T 


T 


G 


T 


T 


G 


T 


A 


T A 


G 


A 


G 


T 


G 


A 


T A 


T 


A 


919 


- 


TAG 


A 


A 


A 


T 


A 


G 


T 


T 


T 


A A 


G 


T 


A 


T 


A 


G 


T G 


T 


G 


920 


- 


TG A 


T 


T 


T 


A 


G 


A 


T 


G 


T 


T T 


A 


T 


T 


G 


T 


G 


A G 


A 


A 


921 


- 


A A G 


T 


T 


G 


A 


T 


A 


T 


T 


T 


G T 


T 


G 


T 


T 


A 


G 


A T 


G 


A 


922 


- 


T G A 


T 


G 


T 


G 


A 


T 


A 


A 


T 


GAG 


A 


A 


T 


A A 


A G 


A 


A 


923 


79 


AAA 


G 


T 


T 


T 


A 


G 


T 


T 


T 


G T 


A 


T 


T 


A 


G 


T 


A G 


A 


G 


924 


- 


A G T 


T 


T 


G 


A 


T 


G 


T 


G 


A 


T A 


G 


T 


A 


A 


A 


T 


A G 


A 


A 


925 


- 


A A G 


T 


G 


T 


T 


A 


T 


T 


G 


A 


A T 


G 


T 


G 


A 


T 


G 


T-T 


A 


T 


926 


- 


AAA 


T 


T 


G 


A 


A 


G 


T 


G 


T 


G A 


T 


A 


A 


T 


G 


T 


T T 


G 


T 


927 


- 


G T T 


T 


A 


G 


T 


G 


A 


T 


T 


A A A G 


A 


T 


A 


G 


A 


T T 


A 


G " 


928 


82 


ATA 


A 


G 


T 


G 


T 


A 


T 


A 


A 


G A 


G 


A 


A 


G 


T 


G 


T T 


A 


A 


929 


- 


A T G 


A 


A 


T 


T 


T 


G 


T 


T 


T 


G T 


G 


A 


T 


G 


A 


A 


G T 


T 


A 


930 


- 


AAA 


G 


A 


A 


T 


T 


G 


A 


G 


A 


A A 


T 


G 


A 


A 


A 


G 


T T 


A 


G 


931 


- 


A G T 


G 


T 


A 


A 


G 


A 


G 


T 


A 


T A 


A 


A 


G 


T 


A 


T 


T T 


G 


A 


932 


- 


G A A 


T 


T 


A 


A 


G 


A 


T 


T 


G 


T T 


A 


T 


A 


T 


G 


T 


G A 


G 


T 


933 


- 


TAT 


G 


A 


A 


A 


G 


T 


G 


T 


T 


G T 


T 


T 


A 


A 


G 


T 


A A 


G 


A 


934 


- 


T A A 


A 


G 


T 


A 


A 


A 


T 


G 


T 


T A 


T 


G 


T 


G 


A 


G 


A G 


A 


A 


935 


- 


AAA 


G 


A 


T 


A 


T 


T 


G 


A 


T 


T G 


A 


G 


A 


T 


A 


G 


A G 


T 


T 


936 


- 


A A G 


T 


G 


A 


T 


A 


T 


G A 


A 


T A 


T 


G 


T 


G 


A 


G 


A A 


A 


T 


937 


- 


AAA 


T 


A 


G 


A 


G 


T 


T 


T 


G 


T T 


A 


A 


T 


G 


T 


A 


A G 


T 


G 


938 


- 


GAT 


T 


T 


A 


G 


A 


T 


G 


A 


G 


T T 


A 


A 


G 


A 


A 


T 


T T 


A 


G 


939 


- 


T T G 


T 


A 


A 


A 


T 


G 


A 


G 


T 


G T 


G 


A 


A 


T 


A 


T 


T G 


T 


A 


940 




A G T 


A 


G 


T 


G 


T 


A 


T 


T 


T 


G A 


G 


A 


T 


A 


A 


T 


A G 


A 


A 


941 


- 


T G A 


G 


T 


T 


A 


A 


A 


G 


A 


G 


T T 


G 


T 


T 


G 


A 


T 


A T 


T 


T 


942 


- 


AAA 


G 


A 


G 


T 


G 


T 


A 


T 


T 


A G 


A 


A 


A 


T 


A 


G 


T T 


T 


G 


943 


- 


G T T 


T 


A 


G 


T 


T 


A 


T 


T 


T 


G A 


T 


G 


A 


G 


A 


T 


A A 


T 


G 


944 


- 


A A G 


T 


G 


T 


A 


A 


A 


T 


G 


A 


A T 


A 


A 


A 


G 


A 


G 


T T 


G 


T 


945 


- 


A A T 


A 


A 


A 


G 


T 


G 


A 


G 


T 


A G 


A 


A 


G 


T 


G 


T 


A A 


T 


T 


946 


- 


TAT 


T 


G 


A 


G 


T 


T 


T 


G 


T 


G T 


A 


A 


A 


G 


A 


A 


G A 


T 


A 


947 


- 


T T T 


A 


T 


A 


G 


T 


T 


G 


T 


T 


G T 


G 


T 


T 


G 


A 


A 


A G 


T 


T 


948 


- 


A T G 


A 


A 


A 


T 


A 


T 


G 


A 


T 


T G 


T 


G 


T 


T 


T 


G 


T T 


G 


T 


949 


- 


AAA 


G 


A 


G 


A 


T 


G 


T 


A 


A 


A G 


T 


G 


A 


G 


T 


T 


A T 


T 


A 


950 


- 


T T G 


A 


A 


G 


A 


A 


A 


G 


T 


T 


A G 


A 


T 


G 


A 


T 


G 


A A 


T 


T 


951 


- 


A T G 


T 


T 


A 


T 


T 


T 


G 


T 


T 


T A 


G 


T 


T 


T 


G 


T 


G T 


G 


A 


952 


- 


AAA 


T 


A 


T 


G 


A 


A 


T 


T 


T 


G A 


A 


G 


A 


G 


A 


A 


G T 


G 


A 


953 


- 


GAT 


T 


A 


G 


A 


T 


A 


T 


A 


G 


A A 


T 


A 


T 


T 


G 


A 


A G 


A 


G 


954 


- 


T T A 


G 


A 


A 


T 


A 


A 


G 


A 


G 


AAA 


T 


G 


T 


A 


T 


G T 


G 


T 


955 


- 


T T T 


A 


T 


G 


A A 


A 


G 


A 


G 


A A 


G 


T 


G 


T 


A 


T 


T A 


T 


G 


956 


- 


G T A 


A 


G 


T 


A 


T 


T 


A A 


G 


T G 


T 


G 


A 


T 


T 


T 


A G 


T 


A 


957 


- 


ATA 


A 


A 


G 


A 


G 


A 


A 


G 


T 


A A 


A 


G 


A 


G 


T 


A 


A A 


G 


T 


958 


- 


ATT 


G 


T 


T 


A 


A 


T 


T 


G 


A 


A G 


T 


G 


T 


A 


T 


G 


A A 


A 


G 


959 




TAT 


A 


T 


A 


G 


T 


T 


G 


A 


G 


T T 


G 


A 


G 


T 


A 


A 


G A 


T 


T 


960 


- 


TAG 


A 


T 


G 


A 


G 


A 


T 


A 


T 


A T 


G 


A 


A 


A 


G 


A 


T A 


G 


T 


961 


- 


ATA 


A 


G 


A 


A 


G 


A 


T 


G 


A 


T T 


T 


G 


T 


G 


T 


A 


A A 


T 


G 


962 


- 


T T A 


G 


T 


A 


A 


T 


A 


A 


G 


A 


A A 


G 


A 


T 


G 


A 


A 


G A 


G 


A 


963 


- 


GAT 


T 


T 


G 


T 


G 


A 


G 


T 


A 


A A 


G 


T 


A A 


A 


T 


A G 


A 


A 


964 


- 


AAA 


T 


A 


G 


A 


T 


G 


T 


A 


G 


A A 


T 


T 


T 


G 


T 


G 


T G 


T 


T 


965 




G A A 


A 


T 


T 


A 


G 


T 


G 


T 


T 


T G 


T 


G 


T 


G 


T 


A 


T T 


A 


T 


966 




ATT 


T 


G 


A 


G 


T 


A 


T 


G 


A 


T A 


G 


A 


A 


G 


A 


T 


T G 


T 


T 


967 




ATA 


G 


A 


G 


T 


T 


G 


A 


A 


G 


T A 


T 


G 


T 


A 


A 


A 


G T 


T 


T 


968 




T A A 


T 


T 


T 


G 


T 


G 


A 


A 


T 


G T 


T 


G 


T 


T 


A 


T 


T G 


T 


G 


969 




T T A 


G 


T 


T 


T 


A 


T 


G 


A 


G 


A G 


T 


G 


A 


G 


A 


T 


T T 


A 


A 


970 • 




G T T 


G 


T 


T 


A 


G 


A 


G 


T 


G 


T T 


T 


A 


T 


G 


A 


A 


A T 


T 


T 


971 




T T T 


A 


T 


T 


G 


T 


G 


A 


T 


G 


T G 


A 


A 


A 


T 


A 


A 


G A 


G 


A 


972 




G T A 


A 


G 


T 


A 


A 


T 


A 


T 


G 


A T 


A 


G 


T 


G 


A 


T 


T A 


A 


G 


973 




T G A 


G 


A 


T 


G 


A 


T 


G 


T 


A 


T A 


T 


G 


T 


A 


G 


T 


A A 


T 


A 


974 





WO 02/059355 
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Table I 



Sequence 




SBQ 


ID NO: 


No. in 
Ex 1 


AATTGAGAAAGAGATA 


A A T G 


A T A G 


975 


85 


TTTGAAGTGATGTTAG 


A A T G 


T T T A 


976 


- 


AGTTGTTGTGTAATTG 


T T A G 


T A A A 


977 


- 


ATAGTGAGAAGTGATA 


A G A T 


A T T T 


978 


- 


GTGTGATAAGTAATTGAGTTAAAT 


979 


- 


TAGTTATTGTTTGTGA 


A T T T 


GAGA 


980 


- 


ATAGTTGAATAGTAAT 


T T G A 


A G A G 


981 


- 


ATGTTTGTGTTTGAAT 


A G A G 


•A A T A 


982 


- 


TGATAAAGATATGAGA 


G A T T 


G T A A 


983 


_ 


TAAAGATGAGATGTTG 


T T A A 


A G T T 


984 


_ 


AAGTGAAATTTGTAAG 


A A T T 


A G T G 


985 


- 


GAAATGAGAGTTATTG 


A T A G 


T T T A 


986 


- 


TTTGTAAATGAGATAT 


A G T G 


T T A G 


987 




GTTAATTGTGATATTT 


G A T T 


A G T G 


988 


_ 


AGAGTGTTGATAAAGA 


T G T T 


TATA 


989 


_ 


AATTGTGAGAAATTGA 


T A A G 


A A G A 


990 




TTAAAGAGAATTGAGA 


A G A G 


A A A T 


991 


_ 


TTGTTAGAAGAATTGA 


A T G T 


A T G T 


992 




AGTTAAGATATGTGTGATGT 


T T A A 


993 


_ 


TGAGTTATGTTGTAAT 


A G A A 


A T T G 


994 




TTAGATAAGTTTAGAG 


A T T G 


A G A A 


995 




ATGAGTAATAAGAGTA 


T T T G 


A A G T 


996 


_ 


TGTTTAAGTGTAATGA 


T T T G 


T T A G 


997 


_ 


TTGAAGAAGATTGTTA 


T T G T 


T G A A 


998 




TATAGAAAGATTAAAG 


A G T G 


A A T G 


999 




TAAATT'GTTAGAAATT 


T G A G 


T G T G 


1000 




ATTGTTAGTGTGTTAT 


T G A T 


T A T G 


1001 




GAGAATTATGTGTGAA 


TATA 


G A A A 


1002 




TTGATTGATAAAGTAA 


A G A G 


T G T A 


1003 




GTGTGTAAATTGAATA 


T G T T 


A A T G 


1004 




AAAGTAAAGAAAGAAG 


T T T G 


A A A G 


1005 


- 


TTTAGTTGAAGAATAG 


AAAGAAAG 


1006 


- 


GTGTAATAAGAGTGAA 


T A G T 


A A T T 


1007 


_ 


TATTGAAATAAGAGAG 


A T T T 


G T G A 


1008 


_ 


A T G A G A. A AGAAGAAGT 


T A A G 


A T T T 


1009 


- 


AAGAGTGAGTATATTG 


T T A A 


A G A A 


1010 




TTTGTAAAGTGATGAT-GTAA 


G A T A 


1011 




GATGTTATGTGATGAA 


A T A T 


G T A T 


1012 


- 


GTAGAATAAAGTGTTA 


A A G T 


G T T A 


1013 


- 


AAAGAGTATGTGTGTA 


T G A T 


A T T T 


1014 


- 


AAAGATAAGAGTTAGT 


A A A T 


T G T G 


1015 


- 


AAGAATTAGAGAATAA 


G T G T 


G A T A 


1016 . 


- 


GATAAGAAAGTGAAAT 


G T A A 


A T T G 


1017 


86 


GATGAAAGATGTTTAA 


A G T T 


T G T T 


1018 


- 


A G T G T A AGTAATAAGT 


T T G A 


G A A A 


1019 


- 


GTTGAGAATTAGAATT 


T G A T 


A A A G 


1020 


87 


TTAAGAAATTTGTATG 


T G T T 


G T T G 


1021 


- 


AGAAGATTTAGATGAA 


A T G A 


G T T T 


1022 




TAAGTTTGAGATAAAG 


A T G A 


T A T G 


1023 




TGAGATAGTTTGTAAT 


A T G T 


T T G T 


1024 




AGTTTGAAATTGTAAG 


T T T G 


A T G A 


1025 




TAGAATTGATTAATGA 


T G A G 


T A G T 


1026 




AGAGATTTGTAATAAG 


T A T T 


G A A G 


1027 




ATAATGATGTAATGTA 


A G T A 


G T G T 


1028 




TGAAATTTGATGAGAG 


A T A T 


G T T A 


1029 




TGTGTAAAGTATAGTT 


T A T G 


T T A G 


1030 




TGAATAAGTGAAATAG 


A A T G 


A A T G 


1031 





WO 02/059355 



PCT/CA02/00089 
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Table I 

Sequence SEQ ID NO: No. in 

Ex 1 



A 


A A 


G A A 


A 


G 


A 


T 


T 


G 


T 


A 


A 


T 


A 


A 


G 


T 


A 


G 


A 


G 


1032 


- 


A 


A T 


G A 


A 


A 


T 


A 


G 


T 


G 


T 


T 


A 


A 


A 


T 


G 


A 


G 


T 


G 


T 


1033 


89 


G 


T A 


G A 


T 


A 


A 


A 


G 


A 


T 


G 


T 


G 


A 


A 


T 


T 


A 


T 


G 


A 


T 


1034 


- 


G 


A T 


A G 


T 


A 


T 


A 


T 


G 


T 


G 


T 


G 


T 


A 


T 


T 


T 


G 


T 


T 


T 


1035 


- 


A 


T G 


T T 


T 


G 


T 


A 


G 


A 


A 


A 


T 


G 


T 


T 


T 


G 


A 


A 


G 


A 


T 


1036 


- 


A 


A A 


T T 


T 


G 


T 


A 


G 


A 


G 


A 


G 


A 


A 


A 


T 


T 


T 


G 


T 


T 


G 


1037 


- 


T 


A G 


A A 


T 


A A 


G 


A 


T 


T 


A 


G 


T 


A 


A 


G 


T 


G 


T 


A 


G 


A 


1038 


- 


T 


G A 


T T 


T 


A 


G 


A 


G 


A 


A 


A 


T 


A 


T 


G 


A 


G 


T 


A 


G 


A 


A 


1039 


- 


A 


A T 


A G 


A 


G 


T 


A 


T 


G 


T 


T 


G 


T 


T 


T 


A 


T 


G 


A 


G 


A 


A 


1040 


- 


G 


A T 


G A 


T 


G 


A 


A 


G 


A 


G 


T 


T 


T 


A 


T 


T 


G 


T 


A 


A 


A 


T 


1041 


- 


A 


A G 


T A 


A 


A 


G 


A 


A 


G 


A 


A 


G 


A 


A 


A 


T 


G 


T 


G 


T 


T 


A 


1042 


- 


T 


T G 


A A 


G 


A 


A 


T 


T 


A 


A 


G 


T 


G 


T 


T 


T 


A 


G 


T 


G 


T 


A 


1043 


- 


A 


G A 


A A 


G 


A A 


T 


G 


T 


T 


G 


A 


T 


T 


T 


A 


T 


G 


A 


T 


G 


T 


1044 


- 


G 


A T 


T A A A 


,G 


A 


G 


A 


T 


G 


T 


T 


G 


A 


T 


T 


G 


A 


A 


A 


T 


1045 


- 


A 


A T 


G A 


T 


A 


A 


T 


T 


G 


T 


T 


G 


A 


G 


A 


G 


A 


G 


T 


A 


A 


T 


1046 


- 


G 


T T 


T G 


T 


T 


G 


A 


A 


A 


G 


T 


G 


T 


A 


A 


A 


G 


T 


A 


T 


A 


T 


1047 


90 


T 


G A 


G T 


T 


A 


T 


A 


T 


G 


A 


G 


A 


A 


A 


G 


T 


G 


T 


A 


A 


T 


T 


1048 


- 


T 


T G 


T G 


A 


G 


A 


A 


A 


G 


A 


A 


G 


T 


A 


T 


A 


T 


A 


G 


A 


A 


T 


1049 


- 


G 


T A 


A G 


T 


T 


T 


A 


G 


A 


G 


T 


T 


A 


T 


A 


G 


A 


G 


T 


T 


T 


A 


1050 


- 


G 


A T 


A G 


A 


T 


A 


G 


A 


T 


A 


A 


G 


T 


T 


A 


A 


T 


T 


G 


A 


A 


G 


1051 


- 


A 


G A 


G A 


T 


G 


A 


T 


T 


G 


T 


T 


T 


A 


T 


G 


T 


A 


T 


T 


A 


T 


G 


1052 


- 


A 


A A 


G T 


T 


A 


A 


G 


A 


A 


A 


T 


T 


G 


T 


A 


G 


T 


G 


A 


T 


A 


G 


1053 


- 


T 


T T 


G A 


T 


A 


T 


T 


G 


T 


T 


T 


G 


T 


G 


A 


G 


T 


G 


T 


A 


T 


A 


1054 


- 


A 


T T 


T G 


T 


A 


G 


A 


A 


A 


G 


T 


T 


G 


T 


T 


A 


T 


G A 


G 


T 


T 


1055 


- 


G 


A T 


T T 


G 


A 


G 


T 


A 


A 


G 


T 


T 


T 


A 


T 


A 


G 


A 


T 


G 


A 


A 


1056 


- 


A 


A G 


A T 


A 


A A 


G 


T 


G 


A 


G 


T 


T 


G 


A 


T 


T 


T 


A 


G A 


T 


1057 


- 


G 


A T 


A T 


T 


G 


T 


A 


A 


G 


A 


T 


A 


T 


G 


T 


T 


G 


T 


A 


A 


A 


G 


1058 


- 


G 


T A 


A G 


A 


G 


T 


G 


T 


A 


T 


T 


G 


T 


A 


A 


G 


T 


T 


A 


A 


T 


T 


1059 


- 


G 


T G 


T G 


A 


T 


T 


A 


G 


T 


A 


A 


T 


G 


A 


A 


G 


T 


A 


T 


T 


T 


A 


1060 


91 


G 


T A 


A G 


A 


A 


A 


G 


A 


T 


T 


A 


A. 


G 


T 


G 


T 


T 


A 


G 


T 


A 


A 


1061 


- 


A 


G T 


A G 


A 


A 


A 


G 


T 


T 


G 


A 


A 


A 


T 


T 


G 


A 


T 


T 


A 


T 


G 


1062 


92 


T 


A A 


G A 


G 


A 


A 


G 


T 


T 


G 


A 


G 


T 


A 


A 


T 


G 


T 


A 


T 


T 


T 


1063 


- 


G 


T T 


A A 


G 


A 


A 


A 


T 


A 


G 


T 


A 


G 


A 


T 


A 


A 


G 


T 


G 


A 


A • 


1064 


- 


T 


A A 


G T 


A 


A 


A 


T 


T 


G 


A 


A 


A 


G 


T 


G 


T 


A 


T 


A 


G 


T 


G 


1065 


- 


A 


A G 


A T 


G 


T 


A 


T 


G 


T 


T 


T 


A 


T 


T 


G 


T 


T 


G 


T 


G 


T 


A 


1066 


- 


A 


T T 


T A 


G 


A 


A 


T 


A 


T 


A 


G 


T 


G 


A 


A 


G 


A 


G 


A 


T 


A 


G 


1067 


- 


G 


T T 


A T 


G 


A A 


A 


G 


A 


G 


T 


A 


T 


G 


T 


G 


T 


T A A A 


T 


1068 


93 


T 


A T 


T A 


T 


G 


T 


G 


A 


A 


G 


A 


A 


G 


A 


A 


T 


G 


A 


T 


T 


A 


G 


1069 


- 


T 


A A 


T A 


A 


G 


T 


T 


G 


A 


A 


G 


A 


G 


A 


A 


T 


T 


G 


T 


T 


G 


T 


1070 


- 


T 


G A 


T G 


T 


T 


T 


G 


A 


T 


G 


T 


A 


A 


T 


T 


G 


T 


T 


A 


A 


A 


G 


1071 


- 


G 


T G 


A A 


A 


G 


A 


T 


T 


T 


G 


A 


G 


T 


T 


T 


G 


T 


A 


T 


A 


A 


T 


1072 


- 


A 


G A 


G A 


A 


T 


A 


T 


A 


G 


A 


T 


T 


G 


A 


G 


A 


T 


T 


T 


G 


T 


T 


1073 


- 


T 


T T 


G A 


G 


A 


T 


G 


T 


G 


A 


T 


G 


A 


T 


A 


A 


A 


G 


T 


T 


A 


A 


1074 


- 


G 


T T 


G T 


A 


A 


A 


T 


T 


G 


T 


A 


G 


T 


A 


A 


A 


G 


A 


A 


G 


T 


A 


1075 


94 


G 


T G 


T T 


A 


T 


G 


A 


T 


G 


T 


T 


G 


T 


T 


T 


G 


T 


A 


T 


T 


A 


T 


1076 


- 


A 


T T 


A T 


T 


G 


T 


G 


T 


A 


G 


A 


T 


G 


T 


A 


T 


T 


A 


A 


G 


A 


G 


1077 




G 


T T 


A G 


A 


A 


A 


G 


A 


T 


T 


T 


A 


G 


A 


A 


G 


T 


T 


A 


G 


T 


T 


1078 




T 


T G 


T G 


T 


A 


T 


T 


A 


A 


G 


A 


G 


A 


G 


T 


G 


A 


A 


A 


T 


A 


T 


1079 




G 


T T 


T A A 


G 


A 


T 


A 


G 


A 


A 


A 


G 


A 


G 


T 


G 


A 


T 


T 


T 


A 


1080 




A A T 


G A 


G 


A 


A 


A 


T 


A 


G 


A 


T A 


G 


T 


T 


A 


T 


T 


G 


T 


G 


1081 




T 


G A A T 


T 


G 


A 


A 


T 


A 


A 


G 


A A 


T 


T 


T 


G 


T 


T 


G 


T 


G 


. 1082 


95 


A 


A T 


A A 


G 


A 


T 


T 


G 


A 


A 


T 


T 


A 


G 


T 


G 


A 


G 


T 


A A 


G 


' 1083 




A 


A T 


G T 


T 


T 


G 


A 


G 


A 


G 


A 


T 


T 


T 


A 


G 


T 


A 


A 


A 


G 


A 


1084 




A 


G T 


T T 


A 


G 


A 


A 


T 


A 


G 


A 


A 


A 


T 


G 


T 


G 


T 


T 


T 


G 


A 


1085 




T 


A T 


A A 


G 


T 


A 


A 


G 


T 


G 


T 


T 


A 


A 


G 


A 


T 


T 


T 


G 


A 


G 


1086 




G 


T A 


G T 


G 


A 


A 


T 


A 


A 


G 


T 


T 


A 


G 


T 


G 


T 


T 


A 


A 


T 


A 


1087 




A 


A G 


T G 


T 


G 


T 


T 


A 


A 


A 


G 


T 


A 


A 


A 


T 


G 


T 


A 


G 


A 


T 


1088 





WO 02/059355 



PCT/CA02/00089 



- 77 - 



Table I 

Sequence SEQ ID NO: No. in 

Ex 1 



A 


G 


A 


G 


A 


T 


G 


T 


T 


T 


A 


T 


G 


T 


T 


G 


T 


G 


A 


A 


T 


T 


A 


A 


1089 


- 


A 


G 


T 


T 


G 


A 


A 


T 


A 


T 


T 


G 


A 


T 


G 


A 


T 


A 


A. 


G 


A 


A 


G 


A 


1090 


- 


T 


G 


A 


A 


T 


G 


T 


G 


A 


G 


A 


T 


G 


T 


T 


T 


A 


G 


A 


A 


T 


A 


A 


T 


1091 


- 


A 


A 


T 


A 


A 


T 


G 


A 


T 


G 


T 


A 


A 


G 


T 


T 


T 


G 


A 


G 


T 


T 


T 


G 


1092 


- 


A 


A 


A 


G 


A 


G 


T 


G 


A 


A 


T 


A 


G 


A 


A 


A 


T 


A 


A 


G 


A 


G 


A 


A 


1093 


- 


A 


A 


T 


A 


A 


A 


G 


T 


T 


A 


T 


T 


G 


A 


G 


A 


G 


A 


G 


T 


T 


T 


A 


G 


1094 


- 


A 


G 


T 


A 


G 


T 


G 


T 


T 


G 


T 


A 


G 


T 


T 


T 


A 


G 


T 


A 


T 


A 


T 


A 


1095 


- 


G 


T 


A 


A 


G 


A 


A 


T 


G 


T 


A 


T 


T 


A 


G 


A 


T 


A 


T 


T 


T 


G 


T 


G 


1096 


- 


G 


A 


T 


A 


A 


A 


T 


G 


T 


T 


T 


G 


A 


T 


A 


A 


A 


G 


T 


A 


G 


T 


T 


G 


1097 


- 


A 


T 


A 


G 


T 


A 


T 


G 


T 


A 


T 


G 


T 


G 


T 


G 


A 


A 


G 


A 


T 


T 


T 


A 


1098 


- 


A 


T 


G 


A 


A 


T 


G 


T 


A 


G 


A 


G 


T 


G 


A 


T 


T 


A 


G 


T 


T 


T 


A 


A 


1099 


- 


G 


T 


A 


G 


T 


A 


T 


T 


T 


A 


G 


T 


G 


A 


T 


G 


T 


A 


A 


G 


A 


A 


T 


A 


1100 


- 


A 


G 


A 


A 


T 


T 


G 


T 


A 


T 


T 


G 


A 


A 


G 


A 


A 


G 


A 


A 


T 


A 


T 


G 


1101 


- 


T 


T 


T 


A 


T 


A 


G 


A 


A 


T 


T 


G 


A 


G 


A 


G 


A 


A 


G 


T 


T 


A 


A 


G 


1102 


- 


A 


A 


A 


G 


T 


A 


G 


T 


A 


G 


A 


G 


A 


T 


T 


T 


G 


A 


G 


A 


A 


T 


T 


A 


1103 


- 


T 


T 


T 


A 


A 


A 


G 


A 


A 


A 


G 


T 


A 


T 


T 


G 


T 


A 


A 


G 


A 


G 


T 


G 


1104 


- 


A 


A 


A 


T 


T 


G 


A 


G 


A 


A 


A 


G 


T 


G 


A 


A 


T 


G 


A 


A 


G 


T 


T 


T 


1105 


- 


A 


A 


G 


A 


A 


A 


T 


A 


A 


G 


T 


A 


T 


G 


A 


T 


A 


G 


T 


A 


G" 


T 


A 


G 


1106 


- 


A 


T 


T 


T 


G 


A 


A 


T 


T 


G 


T 


A 


T 


T 


G 


T 


A 


G 


T 


T 


T 


G 


T 


G 


1107 


- 


A 


A 


G 


A 


G 


A 


A 


T 


A 


A 


T 


G 


T 


A 


G 


A 


G 


A 


T 


A 


T 


A 


A 


G 


1108 


- 


T 


G 


T 


G 


T 


A 


A 


T 


A 


G 


T 


T 


G 


T 


T 


A 


A 


T 


G 


A 


G 


T 


A 


A 


1109 


- 


T 


A 


T 


A 


G 


T 


T 


G 


T 


A 


G 


T 


T 


T 


A 


G 


A 


T 


G 


A 


A 


T 


G 


T 


1110 


- 


A 


T 


T 


G 


T 


G 


T 


T 


A 


G 


A 


A 


T 


G 


A 


T 


G 


T 


T 


A 


A 


T 


A 


G 


1111 


- 


G 


T 


T 


T 


G 


T 


A 


T 


A 


G 


T 


A 


T 


T 


T 


G 


A 


T 


T 


G 


A 


T 


G 


T 


1112 


- 


A 


G 


A 


G 


T 


A 


A 


A 


G 


T 


A 


T 


G 


A 


G 


T 


T 


A 


T 


G 


A 


A 


T 


A 


1113 


- 


G 


A 


A 


A 


G 


T 


T 


T 


A 


A 


G 


T 


G 


A 


T 


G 


T 


A 


T 


A 


T 


T 


G 


T 


1114 


96 


T 


T 


A 


A 


A 


T 


G 


A 


T 


A 


A 


A 


G 


A 


G 


T 


A 


G 


T 


G 


A 


A 


G 


T 


1115 


- 


T 


T 


A 


A 


A 


T 


G 


T 


G 


T 


G 


A 


G 


A 


A 


G 


A 


T 


G 


A 


A 


T 


A 


A 


1116 


- 


A 


T 


T 


T 


G 


T 


A 


T 


A 


A 


A 


G 


T 


G 


A 


A 


G 


A 


A 


G 


A 


G 


A 


A 


1117 


97 


T 


G 


A 


T 


T 


A 


G 


T 


A 


T 


T 


T 


G 


T 


G 


A 


A 


G 


A 


G 


A 


T 


T 


T 


1118 


- 


T 


T 


T 


G 


A 


A 


T 


G 


A 


A 


A 


T 


T 


G 


A 


T 


G 


A 


T 


A 


G 


A 


T 


G 


1119 


- 


A 


G 


A 


G 


T 


A 


A 


G 


A 


T 


T 


A 


A 


G 


A 


A 


T 


A 


A 


G 


A 


A 


A 


G 


1120 


- 


A 


T 


T 


G 


A 


A 


T 


T 


G 


A 


G 


A 


A 


G 


T 


G 


A 


A 


G 


T 


A 


A 


A 


T 


1121 


- 


T 


T 


T 


A 


G 


A 


G 


A 


A 


G 


T 


A 


T 


T 


G 


T 


T 


T 


G 


A 


A 


A 


G 


A 


1122 


- 


T 


A 


A 


A 


G 


T 


G 


A 


A 


A 


G 


A 


T 


T 


T 


G 


A 


A 


A 


T 


G 


A 


T 


G 


1123 


- 


G 


A 


A 


A 


G 


T 


T 


A 


G 


A 


G 


A 


A 


A 


T 


G 


T 


A 


G 


A 


A 


A 


T 


T 


1124 


- 


G 


T 


G 


A 


A 


T 


A 


A 


T 


G 


A 


A 


G 


A 


A 


G 


T 


T 


A 


T 


G 


T 


T 


A 


1125 


98 


T 


T 


G 


T 


G 


A 


A 


T 


A 


A 


A 


G 


T 


A 


G 


A 


T 


G 


T 


G 


T 


T 


A 


T 


1126 


- 


T 


T 


A 


T 


A 


T 


G 


A 


T 


A 


T 


G 


A 


G 


T 


T 


T 


G 


T 


G 


T 


T 


G 


A 


1127 


- 


T 


T 


G 


A 


T 


T 


T 


G 


T 


G 


T 


G 


A 


G 


T 


A 


T 


T 


A 


G 


T 


T 


A 


T 


1128 


- 


A 


A 


A 


G 


T 


G 


A 


T 


T 


A 


A 


G 


T 


T 


A 


G 


T 


T 


T 


G 


A 


G 


A 


T 


1129 


- 


T 


T 


G 


T 


A 


T 


T 


•T 


G 


T 


A 


T 


A 


A 


T 


G 


T 


T 


G 


A 


A 


G 


A 


G 


1130 


- 


G 


T 


T 


T 


G 


A 


A 


A 


T 


T 


A 


G 


T 


G 


T 


G 


A 


G 


A 


A 


A 


T 


A 


T 


1131 




A 


A 


T 


G 


T 


T 


G 


A 


G 


A 


T 


T 


G 


A 


T 


A 


A 


T 


G 


T 


T 


G 


A 


A 


1132 


- 


T 


A 


G 


T 


A 


G 


T 


A 


G 


T 


A 


T 


T' 


G 


T 


T 


G 


T 


A 


A 


T 


A 


A 


G 


1133 


- 


G 


T 


T 


G 


T 


A 


A 


T 


T 


T 


G 


A 


G 


T 


G 


T 


T 


A 


G 


T 


T 


A 


T 


T 


1134 




T 


G 


A 


A 


T 


A 


T 


G 


A 


T 


A 


G 


T 


T 


A 


G 


T 


A 


A 


T 


T 


G 


T 


G 


1135 




T 


G 


A 


T 


A 


G 


T 


A 


T 


G 


T 


T 


T 


G 


T 


G 


A 


T 


T 


A 


A 


A 


G 


A 


1136 




G 


A 


T 


G 


T 


A 


T 


A 


A 


A 


G 


A 


G 


T 


A 


T 


G 


T 


T 


A 


T 


A 


A 


G 


1137 




A 


G 


T 


G 


A 


G 


A 


T 


T 


T 


A 


G 


A 


A 


G 


A 


T 


G 


T 


T 


A 


T 


T 


A 


1138 




A 


T 


G 


A 


G 


A 


A 


T 


T 


T 


G 


T 


T 


A 


A 


A 


G 


A 


G 


A 


A 


A 


G 


T 


1139 




A 


A 


A 


G 


A 


A 


T 


T 


A 


G 


T 


A 


T 


G 


A 


T 


A 


G 


A 


T 


G 


A 


G 


A 


1140 


99 


T 


A 


G 


A 


G 


T 


T 


G 


T 


A 


T 


A 


G 


T 


T 


T 


A 


T 


A 


G 


.T 


T 


G 


A 


1141 




G 


T 


A 


G 


A 


A 


T 


G 


A 


T 


T 


G 


T 


T 


T 


A 


G 


A 


A 


G 


A 


T 


T 


T 


1142 




G 


T 


T 


T 


A 


T 


G 


T 


T 


T 


G 


A 


G 


A 


A 


G 


A 


G 


T 


T 


A 


T 


T 


T 


1143 




T 


A 


G 


A 


A 


G 


T 


T 


T 


G 


A 


A 


A 


G 


T 


T 


A 


T 


T 


G 


A 


T 


T 


G 


1144 




G 


A 


T 


G 


A 


A 


G 


A 


G 


T 


A 


T 


T 


T 


G 


T 


T 


A 


T 


A 


T 


G 


T 


A 


1145 





WO 02/059355 



PCT/CA02/00089 



- 78 - 



Table I 

Sequence * SEQ ID NO: No. in 
Ex 1 



G 


A 


T 


G 


A 


A 


T 


A 


T 


A 


G 


T 


A 


A 


G 


T 


A 


T 


T 


G 


A 


G 


T 


A 


1146 


100 


T 


A 


G 


T 


G 


A 


T 


G 


A 


A 


A 


T 


T 


T 


G 


A 


G 


A 


T 


A 


G 


A 


T 


A 


1147 


- 


G 


A 


A 


A 


G 


A 


A 


A 


T 


T 


G 


A 


A 


G 


A 


G 


T 


T 


T 


G 


A 


T 


A 


T 


114? 


- 


A 


T 


T 


T 


G 


A 


G 


T 


A 


T 


T 


T 


G 


T 


G 


T 


A 


T 


T 


G 


A 


A 


T 


G 


1149 


- 


A 


T 


G 


A 


G 


T 


T 


G 


A 


A 


A 


T 


T 


T 


G 


A 


A 


G 


T 


A 


T 


T 


G 


T 


1150 


- 


T 


T 


A 


A 


T 


A 


G 


T 


G 


A 


G 


A 
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All references referred to in this specification are incorporated 
herein by reference. 
5 The scope of protection sought for the invention described herein 

is defined by the appended claims. It will also be understood that any 
elements recited above or in .the claims, can be combined with the 
elements of any claim. In particular, elements of a dependent claim can 
be combined with any element of a claim from which it depends, or with 
10 any other compatible element of the invention. 

This application claims priority from United States Provisional 
Patent Application Nos. 60/263,710 and 60/303,799, filed January 25, 
2001 and July 10, 2001. Both of these documents are incorporated herein 
by reference. 
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CLAIMS- 

1 . A composition comprising molecules for use as tags or tag 
complements wherein each molecule comprises an oligonucleotide selected 
from a set of oligonucleotides based on a following group of sequences, 
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wherein each of 1 to 3 is a nucleotide base selected to be different 
from the others of 1 to 3 with the proviso that up to three nucleotide 
bases of each sequence can be substituted with any nucleotide base 
provided that: 

for any pair of sequences of the set: 

Ml £ 15, M2 <; 12, M3 £ 19, M4 £ 15, and M5 <, 18, where: 
Ml is the maximum number of matches for any alignment in which there 
are no internal indels; 

M2 is the maximum length of a block of matches for any alignment; 

M3 is the maximum number of matches for any alignment having a maximum 

score; 

M4 is the maximum sum of the lengths of the longest two blocks of 
matches for any alignment of maximum score; and 

M5 is the maximum sum of the lengths of all the blocks of matches 
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having a length of at least 3, for any alignment of maximum score ; 
wherein: 

the score of an alignment is determined according to the equation (A 
x m) - (B x mm) - (C x {og + eg) ) - (D x eg)), wherein: 



A is the total number of matched pairs of bases in the alignment 
B is the total number of internal mismatched pairs in the 
alignment; 

C is the total number of internal gaps in the alignment; and 
D is the total number of internal indels in the alignment minus 
the total number of internal gaps in the alignment; and 
wherein the maximum score is determined separately for each of -(i) , 



2 . A composition comprising molecules for use as tags or tag 
complements wherein each molecule comprises an oligonucleotide selected 
from a set of oligonucleotides based on a group of sequences having the 
numeric pattern of sequences set out in claim 1, wfierein each of 1 to 3 
is a nucleotide base selected to be different from the others of 1 to 3 
with the proviso that up to three nucleotide bases of each sequence can 
be substituted with any nucleotide b^se provided that: 
for any pair of sequences of the set: 

Ml £ 18, M2 <5 16, M3 <; 20, M4 Z 17, and M5 <; 19, where: 

Ml is the maximum number of matches for any alignment in which there 

are no internal indels; 

M2 is the maximum length of a block of matches for any alignment; 

M3 is the maximum number of matches for any alignment having a maximum 

score; 

M4 is the maximum sum of the lengths of the longest two blocks of 
matches for any alignment of maximum score; and 

M5 is the maximum sum of the lengths of all the blocks of matches 
having a length of at least 3, for any alignment of maximum score; 
wherein 



the score of an alignment is determined according to the equation (A 
x m) - (B x nun) - (C x (og + eg)) - (D x eg)), wherein: 
for each of (i) to (iv) : 



for each of (i) to (iv) : 



(i) m = 6, mm = 6, 6g 

(ii) m. = 6, mm » 6, og 

(iii) m « 6, mm « 2, og 

(iv) m 6, mm = 6, og 



0 and eg = 6, 
5 and eg = 1, 



5 and egr = 1, and 
6. and eg m 0, 



(ii) , (iii) and (iv) . 
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(i) m = 6, mm m 6, og ^ 0 and eg = 6, 

(ii) m = 6, mm = 6, og ~ 5 and eg = 1, 

(iii) m = 6, nun = 2, og = 5 and eg = 1, and 

(iv) m - 6, mm =6, og = 6 and eg = 0 , 

A is the total number of matched pairB of bases in the alignment; 
B is the total number of internal mismatched pairs in the 
alignment; 

C is the total number of internal gaps in the alignment; and 
D is the total number of internal indels in the alignment minus 
the total number of internal gaps in the alignment; and 

wherein the maximum score is determined separately for each of (i) , 

(ii) , (iii) and (iv) . 

3 . A composition comprising molecules for use as tags or tag 
complements wherein each molecule comprises an oligonucleotide selected 
from a set of oligonucleotides based on a group of sequences having the 
5 numeric patterns set out in claim 1, wherein each of 1 to 3 is a 

nucleotide base selected to be different from the others of 1 to 3 with 
the proviso that up to three nucleotide bases of each sequence can be 
substituted with any nucleotide base provided that: 
for any pair of sequences of the set: 

Ml s 18, M2 z 16, M3 s 20, M4 <; 17, and M5 <; 19, where: 
Ml is the maximum number of matches for any alignment in which there 
are no internal indels; 

M2 is the maximum length of a block of matches for any alignment; 

M3 is the maximum number of matches for any alignment having a maximum 

score; 

M4 is the maximum sum of the lengths of- the longest two blocks of 
matches for any alignment of maximum score; and 

M5 is the maximum sum of the lengths of all the blocks of matches 
having a length of at least 3, for any alignment of maximum score, 
wherein: 

the score of an alignment is determined according to the equation 3A 
- B - 3C - D, wherein: 

A is the total number of matched pairs of bases in the alignment; 
. B is the total number of internal mismatched pairs in the 

alignment; 

C iB the total number of internal gaps in the alignment; and 
D is the total number of internal indels in the alignment minus 
the total number of internal gaps in the alignment; and 



WO 02/059355 



- 101 - 



PCT/CA02/00089 



4. A composition according to any preceding claim, wherein for the 
group of 24mer sequences in which 1 = A, 2 = T and 3 = G, under a 
defined set of conditions in which the maximum degree of hybridization 

5 between a sequence and any complement of a different sequence of the 
group of 24mer sequences does not exceed 30% of the degree of 
hybridization between said sequence and its complement, for all said 
oligonucleotides of the composition, the maximum degree of hybridization 
between an oligonucleotide and a complement of any other oligonucleotide 
10 of the composition does not exceed 50% of the degree of hybridization of 
the oligonucleotide and its complement. 

5. The composition of claim 4, wherein said maximum degree of 
hybridization between a sequence and any complement of a different 

15 sequence does .not exceed 30% of the degree of hybridization between said 
sequence and its complement, the degree of hybridization between each 
sequence and its complement varies by a factor of between 1 and up to 
10, more preferably between 1 and up to 9, more preferably between 1 and 
up to 8, more preferably between 1 and up to 7, more preferably between 

20 1 and up to 6, and more preferably between 1 and up to 5 . 

6. The composition of claim 4 or 5, wherein the maximum degree of 
hybridization between a sequence and any complement of a different 
sequence does not exceed 25%, more preferably does not exceed 20%, more 

25 preferably does not exceed 15%, more preferably does not exceed 10%, 
more preferably does not exceed 5%'. 

7. The composition of any of claims 4 to 6, wherein said defined set of 
conditions results in a level of hybridization that is the same as the 

30 level of hybridization obtained when hybridization conditions include 
0.2 M NaCl, 0.1 M Tris, 0.08% Triton X-100, pH 8.0 at 37°C. 

8. The composition of any of claims 4 to 7, said defined set of 
conditions includes the group of 24mer sequences being covalently linked 

35 to beads. 

9. The composition of claim 4 wherein, for the group of 24mers the 
maximum degree of hybridization between a sequence and any complement of 
a different sequence does not exceed 15% of the degree of hybridization 

40 between said sequence and its complement and the degree of hybridization 
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between each sequence and its complement varies by a factor of between 1 
and up to 9, and for. all oligonucleotides of the set, the maximum degree 
of hybridization between an oligonucleotide and a complement of any 
other oligonucleotide of the set does not exceed 20% of the degree of 
hybridization of the oligonucleotide and its complement. 

10. The composition of any preceding claim wherein: each 1 is one of 
A, T/U, G and C; each 2 is one of A, T/U, G and C; and each 3 is one of 
A, T/U, G and C; and each of 1, 2 and 3 is selected so as to be 
different from all of the others of 1, 2 and 3. 

11. The composition of claim 10, wherein 1 is A or T/U, 2 is A or T/U 
and 3 is G or C. 

12. The composition of claim 11, wherein 1 is A, 2 is T/U and 3 is G. 

13 . The composition of any preceding claim, wherein each said 
oligonucleotide is from twenty- two to twenty-six bases in length, or 
from twenty-three to twenty-five. 

14. The composition of any of claims 1 to 13, wherein each said 
oligonucleotide is of the same length as every other said 
oligonucleotide. 

15. The composition of claim 14, wherein each said oligonucleotide is 
twenty-four bases in length. 

16. The composition of any of claims 1 to 13 wherein no said 
oligonucleotide contain? more than four contiguous bases that are 
identical to each other. 

17. The composition of any of claims 1 to 12 and 16 wherein the number 
of G's in each said oligonucleotide does not exceed L/4 where L is the 
number of bases in said sequence. 

o 

18. The composition of any of claims 1 to 14 and 16 and 17, wherein the 
number of G's in each said oligonucleotide does not vary from the 
average number of G's in all of the said oligonucleotides by more than 
one. 
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19. The composition of any claims l to 14 and 16 to 18, wherein the 
number of G's in each said oligonucleotide is the same as every other 
said oligonucleotide. 

5 20. The composition of claim 19, wherein each said oligonucleotide is 
twenty-four bases in length and each said oligonucleotide contains 6 
G' s. 

21. The composition of any of claims 1 to 12 and claims 16 to 20, 

10 wherein, for each said nucleotide, there is at most six bases other than 
G between every pair of neighboring pairs of G's. 

22. The composition of any of claims 1 to 12 and claims 16 to 21, 
wherein at the 5' -end of each said oligonucleotide at least one of the 

15 first, second, third, fourth, fifth, sixth and seventh bases of the 
sequence of the oligonculeotide is a G. 

23. The composition of any of claims 1 to 12 and claims 16 to 21, 
wherein at the 3' Tend of each said oligonucleotide at least one of the 

20 first, second, third, fourth, fifth, sixth and seventh bases of the 
sequence of the oligonucleotide is a G. 

24. The composition of claim 22, wherein at the 3' -end of each said 
oligonucleotide at least one of the first, second, third, fourth, fifth, 

25 sixth and seventh bases of the sequence of the oligonucleotide is a G. 

25. The composition of any of claims 1 to 24, comprising ten, pr 
twenty, or thirty, or forty, or fifty, or sixty, or seventy, or eighty, 
or ninety, or one hundred, or one hundred and ten, or one hundred and 

30 twenty, or one hundred and thirty, or one hundred and forty, or one 

hundred and fity, or one hundred and sixty said molecules, or comprising 
one hundred and seventy said molecules, or comprising one hundred and 
eighty said molecules, or comprising one hundred and ninety said 
molecules, or comprising two hundred said molecules, or comprising two 

35 hundred and* twenty said molecules, or comprising two hundred, and forty 
said molecules, or comprising two hundred and sixty said molecules, or 
comprising two hundred and eighty said molecules, or comprising three 
hundred said molecules, or comprising four hundred said molecules, or 
comprising five hundred said molecules, or comprising six hundred said 

40 molecules, or comprising seven hundred said molecules, or comprising 
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eight hundred said molecules, or comprising nine hundred said molecules, 
or comprising one thousand said molecules, eleven hundred said 
molecules . 

5 26. A composition of any of claims 1 to 25, wherein each said molecule 
is linked to a solid phase support so as to be distinguishable from a 
mixture of other said molecules by hybridization to its complement. 

27. The composition of claim 26, wherein each molecule is linked to a 
10 defined location on a said solid phase support, the defined location for 

each said molecule being different than the defined location for 
different other said molecules. 

28. The composition of claim 27, wherein each said solid phase support 
15 is a microparticle and each said molecule is covalently linked to a 

different microparticle than each other different said molecule. 

29. A composition comprising a set of 150 molecules for use as tags or 
tag complements wherein each molecule comprises an oligonucleotide 

20 having a sequence of at least sixteen nucleotide bases wherein for any 
pair of sequences of the set: 

Ml > 19/24 x LI, M2 > 17/24 x LI, M3 > 21/24 X LI, M4 > 18/24 x LI, M5 > 
20/24 x LI, where LI is the length of the shortest sequence of the pair, 
where : 

Ml is the maximum number of matches for any alignment of the pair of 
sequences in which there are no internal indels; 

M2 is the maximum length of a block of matches for any alignment of the 
pair of sequences ; 

M3 is the maximum number of matches for any alignment of the pair of 
sequences having a maximum score; 

M4 is the maximum sum of the lengths of the longest two blocks of 
matches for any alignment of the pair of sequences of maximum score; 
and 

M5 is the maximum sum of the lengths of all the blocks of matches 
having a length of at least 3, for any alignment of the pair of 
sequences of maximum score, wherein: 

the score of an alignment is determined according to the equation (A 
x m) - (B x mm) - (C x (og + eg) ) - (D x eg) ) , wherein: 
for each of (i) to (iv) : 

(i) m = 6, mm = 6, og = 0 and eg = 6, 
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(ii) m = 6, nun = 6, og - 5 and eg = 1, 

(iii) m=6, mm=2, og=5 and eg = 1, and 

(iv) m = 6, mm = 6, og = 6 and eg = 0, 

A is the total number of matched pairs of bases in the alignment; 
B is the total number of internal mismatched pairs in the 
alignment; 

C is the total number of internal gaps in the alignment; and 
D is the total number of internal indels in the alignment minus 
the total number of internal gaps in the alignment; and 

wherein the maximum score is determined separately for each of (i) , 

(ii) , (iii) and (iv) . 

30. A composition comprising a set of 150 molecules for use as tags or 
tag complements wherein each molecule comprises an oligonucleotide 
having a sequence of at least sixteen nucleotide bases wherein for any 
5 pair of sequences of the set: 

Ml s 18, M2 £ 16, M3 i 20, M4 £ 17, and M5 <; 19, where: 
Ml is the maximum number of matches for any alignment of the pair of 
sequences in which there are no internal indels; 

M2 is the maximum length of a block of matches for any alignment of the 
pair of sequences; 

M3 is the maximum number of matches for any alignment of the pair of 
sequences having a maximum score; 

M4 is the maximum sum of the lengths of the longest two blocks of 
matches for any alignment of the pair of sequences of maximum score; 
and 

M5 is the maximum sum of the lengths of all the blocks of matches 
having a length of at least 3, for any alignment of the pair of 
sequences of maximum score, wherein: 

the score of a said alignment is determined according to the equation 
3A - B - 3C - D, wherein: 

A is the total number of matched pairs of bases in the alignment; 
B is the total number of internal mismatched pairs in the 
alignment; 

C is the total number of internal gaps in the alignment; and 
D is the total number of internal indels in the alignment minus 
the total number of internal gaps in the alignment. 



31. The composition of claim 29 or 30, wherein each said sequence has 
up to fifty bases. 
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32. The composition of claim 31, wherein each said sequence is between 
sixteen and forty bases in length, or between sixteen and thirty- five 
bases in length, or between eighteen and thirty bases in length, or 

5 between twenty and twenty-eight bases in length, or between twenty-one 
and twenty-seven bases in length, or between twenty-two and twenty-six 
bases in length. 

33. The composition of any of claims 29 to 32, wherein each said 
10 sequence is of the same length as every other said sequence. 

34. The composition of claim 33, wherein each said sequence is twenty- 
four bases in length. 

15 35. The composition of any of claims 29 to 34 wherein no said sequence 
contains more than four contiguous bases that are identical to each 
other . 

36. The composition of any of claims 29 to 35 wherein the number of G' s 
20 in each said sequence does not exceed L/4 where L is the number of bases 

in said sequence. 

37. The composition of claim 36, wherein the number of G's in each said 
sequence does not vary from the average number of G' s in all of the 

25 sequences of the set by more than one. 

38. The composition of claim 37, wherein the number of G's in each said 
sequence is the same as every other sequence of the set. 

30 39. The composition of claim 37, wherein each said sequence is twenty- 
four bases in length and each said sequence contains 6 G's. 

40. The composition of any of claims 29 to 39 wherein, for each said 
sequence, there is at most six bases other than G between every pair of 

35 neighboring pairs of G's. 

41. The composition of any of claims 29 to 40, wherein at the 5' -end of 
each said sequence at least one of the first, second, third, fourth, 
fifth, sixth and seventh bases of the sequence is a G. 

40 
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42. The composition of any of claims 29 to 40, wherein at the 3' -end of 
each said sequence at least one of the first., second, third, fourth, 
fifth, sixth and seventh bases of the sequence is a G. 

5 43. The composition of claim 41, wherein at the 3' -end of each said 

sequence at least one of the first, second, third, fourth, fifth, sixth 
and seventh bases of the sequence is a G. 

44. The composition of any of claims 29 to 43, wherein under a defined 
10 set of conditions, the maximum degree of hybridization between a said 

oligonucleotide and any complement of a different oligonucleotide of the 
composition does not exceed about 30% of the degree of hybridization 
between said oligonucleotide and its complement, more preferably 20%, 
more preferably 15%, more preferably 10%, more preferably 6%. 

15 

45. The composition of claim 44, wherein said set of conditions results 
in a level of hybridization that is the same as the level of 
hybridization obtained when hybridization conditions include 0.2 M NaCl, 
0.1 M Tris, 0.08% Triton X-100, pH 8.0 at 37°C, and the oligonucleotides 

20 are covalently linked to microparticles . 

46. The composition of claim 45, wherein under said defined set of 
conditions, the degree of hybridization between each oligonucleotide and 
its complement varies by a factor of between 1 and up to 8, more 

25 preferably up to 7, more preferably up to 6, more preferably .up to 5. 

47. The composition of any of claims 29 to 46, comprising one hundred 
and sixty said molecules, .or comprising one hundred and seventy said 
molecules, or comprising one hundred and eighty said molecules, or 

30 comprising one hundred and ninety said molecules, or comprising two 
hundred said molecules, or comprising two hundred and twenty said 
molecules, or comprising two hundred and forty said molecules, or 
comprising two hundred and sixty said molecules, or comprising two 
hundred and eighty said molecules, or comprising three hundred said 

35 molecules, or comprising four hundred said molecules, or comprising five 
hundred said molecules, or comprising six hundred said molecules, or 
comprising seven hundred said molecules, or comprising eight hundred 
said molecules, or comprising nine hundred said molecules, or comprising 
one thousand said molecules. 
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48. A composition of any of claims 29 to 47, wherein each said molecule 
is linked to a solid phase support so as to be distinguishable from a 
mixture of other said molecules by hybridization to its complement. 

5 49. The composition of claim 4 8, wherein each molecule is linked to a 
defined location on a said solid phase support, the defined location for 
each said molecule being different than the defined location for 
different other said molecules. 

10 50. The composition of claim 49, wherein each said solid phase support 
is a microparticle and each said molecule is covalently to a different 
microparticle than each other different said molecule. 

51. A composition comprising one hundred and fifty minimally cross- 
15 hybridizing molecules for use as tags or tag complements wherein each 

molecule comprises an oligonucleotide comprising a sequence of 
nucleotide bases for which, under a defined set of conditions, the 
maximum degree of hybridization between a said oligonucleotide and any 
complement of a different oligonucleotide does not exceed about 20% of 
20 the degree of hybridization between said oligonucleotide and its 
complement . 

52. The composition of claim 51, wherein each said sequence has between 
ten and fifty bases. 

25 

53. The composition of claim 52, wherein each said sequence is between 
sixteen and forty bases in length, or between sixteen and thirty-five 
bases in length, or between eighteen and thirty bases in length, or 
between twenty and twenty-eight bases in length, or between twenty-one 

3 0 and twenty-seven bases in length, or between twenty-two and twenty-six 
bases in length. 

54. The composition of any of claims 51 to 53, wherein each said 
sequence is of the same length as every other said sequence. 

35 

55. The composition of claim 54, wherein each said sequence is twenty- 
four bases in length. 
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56. The composition of any of claims 51 to 55 wherein no said sequence 
contains more than four contiguous bases that are identical to each 
other . 

5 57. The composition of any of claims 51 to 56 wherein the number of G's 
in each said sequence does not exceed L/4 where L is the number of bases 
in said sequence. 

58. The composition of claim 57, wherein the number of G's in each said 
10 sequence does not vary from the average number of G's in all of the 

sequences of the set by more than one. 

59. The composition of claim 58, wherein the number of G's in each said 
sequence is the same as every other sequence of the set. 

15 

60. The composition of claim 58, wherein each said sequence is twenty- 
four bases in length and each said sequence contains 6 G's. 

61. The composition of any of claims 51 to 60 wherein, for each said 

2 0 sequence, there is at most six bases other than G between every pair of 
neighboring pairs of G's. 

i 

62. The composition of any of claims 51 to 61, wherein at the 5' -end of 
each said sequence at least one of the first, second, third, fourth, 

25 fifth, sixth and seventh bases of the sequence is a G. 

63. The composition of any of claims 51 to 61, wherein at the 3' -end of 
each said sequence at least one of the first, second, third, fourth, 
fifth, sixth and seventh bases of the sequence is a G. 

30 

64. The composition of claim 62, wherein at the 3' -end of each said 
sequence at least one of the first, second, third, fourth, fifth, sixth 
and seventh bases of the sequence is a G, 

35 65. The composition of any of claims 61 to 64, wherein under a said 

defined set of conditions, the maximum degree of hybridization between a 
said oligonucleotide and any complement of a different oligonucleotide 
of the composition does not exceed about 15%, more preferably 10%, more 
preferably 6%. 

40 
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66. The composition of claim 65, wherein said set of conditions results 
in a level of hybridization that is the same as the level of 
hybridization obtained when hybridization conditions include 0.2 M NaCl, 
0.1 M Tris, 0.08% Triton X-100, pH 8.0 at 37 °C, and the oligonucleotides 

5 are covalently linked to microparticles . 

67. The composition of claim 66, wherein under said defined set of 
conditions, the degree of hybridization between each oligonucleotide and 
its complement varies by a factor of between 1 and up to 8, more 

10 preferably up to 7, more preferably up to 6, more preferably up to 5. 

68. The composition of any of claims 51 to 67, comprising one hundred 
and sixty said molecules, or comprising one hundred and seventy said 
molecules, or comprising one hundred and eighty said molecules, or 

15 comprising one hundred and ninety said molecules, or comprising two 
hundred said molecules, or comprising two hundred and twenty said 
molecules, or comprising two hundred and forty said molecules, or 
comprising two hundred and sixty said molecules, or comprising two 
hundred and eighty said molecules, or comprising three hundred said 

20 molecules, or comprising four hundred said molecules, or comprising five 
hundred said molecules, or comprising six hundred said molecules, or 
comprising seven hundred said molecules, or comprising eight hundred 
said molecules, or comprising nine hundred said molecules, or comprising 
one thousand said molecules. 

25 

69. A composition of any of claims 51 to 68, wherein each said molecule 
is linked to a solid phase support so as to be distinguishable from a 
mixture of other said molecules by hybridization to its complement. 

30 70. The composition of claim 69, wherein each molecule is linked to a 

defined location on a said solid phase support, the defined location for 
each said molecule being different than the defined location for 
different other said molecules. 

35 71. The composition of claim 70, wherein each said solid phase support 
is a microparticle and each said molecule is covalently to a different 
microparticle than each other different said molecule. 

72. A composition according to any of claims 1 to 71, wherein each said 
40 molecule comprises a tag complement. 
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73. A composition according to any of claims 1 to 72, wherein any base 
is substituted by an analogue thereof. 

74 . A composition according to any of claims 1 to 73 , wherein each said 
5 molecule comprises a tag complement. 

75. A kit for sorting and identifying polynucleotides, the kit 
comprising one or more solid phase supports each having one or more 
spatially discrete regions, each such region having a uniform population 

10 of substantially identical tag complements covalently attached, and the 
tag complements each being selected from the set of oligonucleotides as 
defined in any of claims 1 to 84 . 

76. A kit according to claim 75, wherein there is a tag complement for 
15 each said oligonucleotide of a said composition. 

77. A kit according to claim 75 or 76 wherein said one or more solid 
phase supports is a planar substrate and wherein said one or more 
spatially discrete regions is a plurality of spatially addressable 

20 . regions. 

78. A kit according to any of claims 75 to 77 wherein said one or more 
solid phase supports is a plurality of microparticles. 

25 79. A kit according to claim 78 wherein said microparticles each have a 
diameter in the range of from 5 to 40 um. 

80. A kit according to claim 78 or 79, wherein each microparticle is 
spectrophotometrically unique from each other microparticle having a 

30 different oligonucleotide attached thereto. 

81. A method of analyzing a biological sample comprising a biological 
sequence for the presence of a mutation or polymorphism at a locus of 
the nucleic acid, the method comprising: 

(A) amplifying the nucleic acid molecule in the presence of a first primer 
having a 5' -sequence having the sequence of a tag complementary to the 

' sequence of a tag complement belonging to a family of tag complements 
as defined in claim 74 to form an amplified molecule with a 5' -end with 
a sequence complementary to the sequence of the tag; 

(B) extending the amplified molecule in the presence of a polymerase and a 
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second primer having 5' -end complementary the 3' -end of the amplified 
sequence, with the 3 '-end of the second primer extending to immediately 
adjacent said locus, in the presence of a plurality of nucleoside 
triphosphate derivatives each of which is: (i) capable of 
incorporation during transciption by the polymerase onto the 3' -end of 
a growing nucleotide strand; (ii) causes termination of polymerization; 
and (iii) capable of differential detection, 'one from the other, 
wherein there is a said derivative complementary to each possible 
nucleotide present at said locus of the amplified sequence; 

(C) specifically hybridizing the .second primer to a tag complement having 
the tag complement sequence of (A) ; and 

(D) detecting the nucleotide derivative incorporated into the second primer 
in (B) so as to identify the base located at the locus of the nucleic 
acid. 



82. A method of analyzing a biological sample comprising a plurality of 
nucleic acid molecules for the presence of, a mutation or polymorphism at 
a locus of each nucleic acid molecule, for each nucleic acid molecule, 
5 the method comprising: 

(A) amplifying the nucleic acid molecule in the presence of a first primer 
having a 5' -sequence having the sequence of a tag complementary to the 
sequence of a tag complement belonging to a family of tag complements 
as defined in. claim 14 to form an amplified molecule with a 5'-end with 
a sequence complementary to the sequence of the tag; 

(B) extending the amplified, molecule in the presence of a polymerase and a 
second primer having 5' -end complementary the 3' -end of the amplified 
sequence, the 3' -end of the second primer extending to immediately 
adjacent said locus, in the presence of a plurality of nucleoside, 
triphosphate derivatives each of which is: (i) capable of 
incorporation during transciption by the polymerase onto the 3' -end of 
a growing nucleotide strand; (ii) causes termination of polymerization; 
and (iii) capable of differential detection, one from the other, 
wherein there is a said derivative complementary to each possible 
nucleotide present at said locus of the amplified molecule; 

(C) specifically hybridizing the second primer to a tag complement having 
the tag complement sequence of (A) ; and 

(D) detecting the nucleotide derivative incorporated into the. second primer 
in (B) so as to identify the base located at the locus of the nucleic 
acid; 
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wherein- each tag of (A) is unique for each nucleic acid molecule and steps 
(A) and (B) are carried out with said nucleic molecules in the presence of 
each other. 

83 . A method of analyzing a biological sample comprising a plurality of 
double, stranded complementary nucleic acid molecules for the presence of 
a mutation or polymorphism at a locus of each nucleic acid molecule, for 
each nucleic acid molecule, the method comprising: 

(A) amplifying the double stranded molecule in the presence of a pair of 
first primers, each primer having an identical 5' -sequence having the 
sequence of a tag complementary to the sequence of a tag complement 
belonging to a family of tag complements as defined in claim 74 to form 
amplified molecules with 5 ' -ends with a sequence complementary to the 
sequence of the tag; 

(B) extending the amplified molecules in the presence of a polymerase and a 
pair of second primers each second primer having a 5' -end complementary 
a 3' -end of the amplified sequence, the 3' -end of each said second 
primer extending to immediately adjacent said locus, in the presence of 
a plurality of nucleoside triphosphate derivatives each of which is: 

(i) capable of incorporation during transciption by the polymerase onto 
the 3' -end of a growing nucleotide strand; (ii) causes termination of 
polymerization; and (iii) capable of differential detection, one from 
the other; 

(C) specifically hybridizing each of the second primers to a tag complement 
having the tag complement sequence of (A) ; and 

(D) detecting the nucleotide derivative incorporated into the second 
primers in (B) so as to identify the base located at said locus; 

wherein the sequence of each tag of (A) is unique for each nucleic acid 
molecule and steps (A) and (B) are carried out with said nucleic molecules in 
the presence of each other. 

84. A method of analyzing a biological sample comprising a plurality of 
nucleic acid molecules for the presence of a mutation or polymorphism at 
a locus of each nucleic acid molecule, for each nucleic acid molecule, 
the method comprising: 

(a) hybridizing the molecule and a primer, the primer having a 5' -sequence 
having the sequence of a tag complementary to the sequence of a tag 
complement belonging to a family of tag complements as defined in claim 
74 and a 3' -end extending to immediately adjacent the locus; 

(b) enzymatically extending the 3' -end of the primer in the presence of a 
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plurality of nucleoside triphosphate derivatives each of which is: (i) 
capable of enzymatic incorporation onto the 3' -end of a growing 
nucleotide strand; (ii) causes termination of said extension; and (iii) 
capable of differential detection, one from the other, wherein there is 
a said derivative complementary to each possible nucleotide present at 
said locus ; 

(c) specifically hybridizing the extended primer formed in step (b) to a 
tag complement having the tag complement sequence of (a) ; and 

(d) detecting the nucleotide derivative incorporated into the primer in 
step (b) so as to identify the base located at the locus of the nucleic 
acid molecule; 

wherein each tag of (a) is unique for each nucleic acid molecule and steps 
(a) and (b) are carried out with said nucleic molecules in the presence of 
each other. 



85. The method of claim 82 wherein each said derivative is a dideoxy 
nucleoside triphosphate . 

5 86. The method of claim 84, wherein each respective complement is 

attached as a uniform population of substantially identical complements * 
in a spacially discrete region on one or more said solid phase supports. 

87. The method of claim 86, each said tag complement comprises a label, 
10 each such label being different for respective complements, and step (d) 

includes detecting the presence of 'the different labels for respective 
hybridization complexes of bound tags and tag complements. 

88. The hybridized molecule and primer of step (A) of any of claims 84 
15 to 87. 

89. A method of determining the presence of a target suspected of being 
contained in a mixture, the method comprising the steps of: 

(i) labelling the target with a first label ; 

(ii) providing a first detection moiety capable of specific binding to the 
target and including a first tag; 

(iii) exposing a sample of the mixture to the detection moiety under 
conditions suitable to permit (or cause) said specific binding of the 
molecule and target; 

(iv) providing a family of tag complements as defined in claim 74 wherein 
the family contains a first tag complement having a sequence 
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complementary to that of the first tag; 

(v) exposing the sample to the family of tag complements under conditions 
suitable to permit (or cause) specific hybridization of the first tag 
and its tag complement; 

(vi) determining whether a said first detection moiety hybridized to a first 
said tag complement is bound to a said labelled target in order to 
determine the presence or absence said target in the mixture. 

90. The method of claim 89 wherein said first tag complement is linked 
to a solid support at a specific location of the support and step (vi) 
includes detecting the presence the first label at said specified 

5 location. 

91. The method of claim 89 wherein said first tag complement comprises 
a second label and step (vi) includes detecting the presence of the 
first and second labels in a hybridized complex of the moiety and the 

10 first tag complement. 

92. The method of claim 89 wherein said target is selected from the 
group consisting of organic molecules, antigens, proteins, polypeptides, 
antibodies and nucleic acids . 

15 

93. The method of claim 92, wherein said target is an antigen and said 
first molecule is an antibody specific for said antigen. 

94. The method of claim 93, wherein the antigen is a polypeptide or 
20 protein and the labelling step includes conjugation of fluorescent 

molecules, digoxigenin, biotinylation and the like. 

95. The method of claim 94, wherein said target is a nucleic acid and 
the labelling step includes incorporation of fluorescent molecules, 

25 radiolabeled nucleotide, digoxigenin, biotinylation and the like. 
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